Extract transcript fasta sequences using gtf
0
0
Entering edit mode
2.6 years ago
otieno43 ▴ 30

Hello, I am trying to extract fasta sequences of transcripts in genome fasta file with gtf annotation file using gffread that used the command: gffread -w transcripts.fa -g /path/to/genome.fa transcripts.gtf. However, when I execute this, I get 0k of trranscript.fasta (empty). I am unable to figure out what could be the problem. Anyone who can help please.

Here is the command I used and the results:

./gffread/gffread -w transcripts.fasta -g /Users/ee/Desktop/genome/VectorBase-56_GfuscipesIAEA2018_Genome.fasta /Users/ee/Desktop/genome/Gfus_x10_vNew.gtf

Results
290K VectorBase-56_GfuscipesIAEA2018_Genome.fasta
0B transcripts.fa
2.1K gclib

Thanks Erick

transcript using fasta gtf sequences • 2.8k views
ADD COMMENT
1
Entering edit mode

Are the fasta and gtf obtained from the same source? You may want to include a link to them. It's important that the gtf corresponds to the correct assembly, and that formatting such as chromosome names are the same.

ADD REPLY
1
Entering edit mode

On top of the important checks mentioned by rpolicastro you can check what type of feature is present in the gff file ( You need transcript feature for that command to work). You may use agat_sp_extract_sequences.pl from AGAT that will list at the beginning of the parsing log, the feature types found in your file.

ADD REPLY
0
Entering edit mode

The gtf file has transcript feature in it.

ADD REPLY
0
Entering edit mode

Yes the fasta and gtf files are from the same source. The gtf features correspond correctly with the genome assembly. Everything seems correct, I can't just figure out what is the issue. I can share with you the gtf and genome fasta file.

ADD REPLY
0
Entering edit mode

Yea, please share the files and I can check them.

ADD REPLY

Login before adding your answer.

Traffic: 1966 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6