gffread "Error (GFaSeqGet): end coordinate (76134) cannot be larger than sequence length 76132"
0
0
Entering edit mode
7.6 years ago
qwzhang0601 ▴ 80

Hello: I am trying to get coding sequences based on gff file and the genome fast file. I am using the function gffread. It works well with "gffread my.gff -g genome.assembly.fasta -x cds.fa". But after I add the parameter "-J" it reported an error "Error (GFaSeqGet): end coordinate (76134) cannot be larger than sequence length 76132". But I manually checked the annotation of the contig with the length 76132, and found there is no annotation with the end coordinate "76134". The maximum coordinate is 76132.

About the "-J" parameter

-J discard any mRNAs that either lack initial START codon or the terminal STOP codon, or have an in-frame stop codon (only print mRNAs with a fulll, valid CDS)

Anybody have met the same problem?

Thanks

gff Cufflink gffread • 4.7k views
ADD COMMENT
0
Entering edit mode

Hi. I have the same issue. Did you find any solution?

ADD REPLY
0
Entering edit mode

Please use ADD COMMENT/ADD REPLY when responding to existing posts to keep threads logically organized.

ADD REPLY
0
Entering edit mode

I had the same problem... For me the "easiest" way is to do it is to run gffread without the -J parameter. Then with a custom script check that each sequence with "ATG" and finish with one stop-codon. Additionally, you have to look for stop codons inside the sequence.

ADD REPLY

Login before adding your answer.

Traffic: 1347 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6