How to retrieve all the intron sequences and information (e.g. location) if I have a genome. Thanks in advance!
How to retrieve all the intron sequences and information (e.g. location) if I have a genome. Thanks in advance!
combine your exon and UTR files, then do a "bedtools intersect -v" against the gene file. everything remaining is intron (or at least this would be a good first guess).
If it is one of the genomes available at UCSC/Ensembl you could use Table browser/BioMart (click on BioMart is top navigation bar) respectively to get that information.
Look for a gff or gft file of your genome.
If no present, you can try some annotation using programs like genemark, August, et
@biomoon: If you have a GTF file then can you see annotations labelled as "introns"? You can use the answer in this thread to get the sequence: convert bed file to fasta file
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Thanks. I have gtf file and a genome sequence. Bedtools can do that, right?