How to retrieve all the intron sequences and information (e.g. location) if I have a genome.
3
0
Entering edit mode
8.7 years ago
biomoon ▴ 10

How to retrieve all the intron sequences and information (e.g. location) if I have a genome. Thanks in advance!

next-gen sequence genome • 3.8k views
ADD COMMENT
0
Entering edit mode
8.7 years ago
dlyons • 0

combine your exon and UTR files, then do a "bedtools intersect -v" against the gene file. everything remaining is intron (or at least this would be a good first guess).

ADD COMMENT
0
Entering edit mode

Thanks. I have gtf file and a genome sequence. Bedtools can do that, right?

ADD REPLY
0
Entering edit mode
8.7 years ago
GenoMax 147k

If it is one of the genomes available at UCSC/Ensembl you could use Table browser/BioMart (click on BioMart is top navigation bar) respectively to get that information.

ADD COMMENT
0
Entering edit mode

Thank you. Good idea for a genome available in website. What I want is to get intron sequence from a new genome.

ADD REPLY
0
Entering edit mode
8.7 years ago

Look for a gff or gft file of your genome.

If no present, you can try some annotation using programs like genemark, August, et

ADD COMMENT
0
Entering edit mode

Thanks. I have got a gtf file using augustus. Can you tell me how to put all intron sequences to a single file? Thank you.

ADD REPLY
0
Entering edit mode

Can you be more precise indicating what exactly do you want to get?

ADD REPLY
0
Entering edit mode

@biomoon: If you have a GTF file then can you see annotations labelled as "introns"? You can use the answer in this thread to get the sequence: convert bed file to fasta file

ADD REPLY

Login before adding your answer.

Traffic: 2578 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6