Entering edit mode
6.5 years ago
worarado.kan
▴
20
Hello everyone,
I try to change GFF file to GTF for analysing in HISAT.
Why when I run following this command;
gffread Blueberry.gff -T -o Blueberry.gtf
Showed this error
GFF Error: overlapping duplicate gene feature (ID=CUFF.71) GFF Error: overlapping duplicate gene feature (ID=CUFF.168) GFF Error: overlapping duplicate gene feature (ID=CUFF.279) GFF Error: overlapping duplicate gene feature (ID=CUFF.330) ....................................
What I am wrong? Please give me the guidance.
Thank you so much
Hello, Please run:
Then paste the output here.
i have the same issue while trying to convert gff3 files to gtf file using gffread. i am new to RNAseq data analysis , i am trying to convert my bam files to raw count so could use DEseq2 to find DEGs, below is the error message after i run the command line,
hope i can get some help.
i did "
grep -e "mRNA20294
" my file" got run info below:thanks,
Hey, well, just looking at your output, I can see that mRNA20294 is indeed duplicated:
How did you produce this data?
thanks for the quick reply, i downloaded the reference file from JGI website for Daphnia pulex. The only option is either a .gff file or . gff3 file. The data used downloaded .gff3 file with command line gffread -E my file.gff3 -T -o my file. gtf, which i got nothing in my output. this is the gff3 file looks like :
gff-version 3
scaffold_1 . gene 62188 73952 . + . ID=gene1;Name=gw1.1.25.1 scaffold_1 . mRNA 62188 73952 . + . ID=mRNA1;Parent=gene1;Name=gw1.1.25.1 scaffold_1 . exon 62188 62346 . + . ID=exon1;Parent=mRNA1 scaffold_1 . exon 62647 62736 . + . ID=exon2;Parent=mRNA1 scaffold_1 . exon 67111 67169 . + . ID=exon3;Parent=mRNA1 scaffold_1 . exon 67365 67525 . + . ID=exon4;Parent=mRNA1
but if i use .gff file , scaffold_1 JGI exon 264824 264890 . + . name "estExt_fgenesh1_kg.C_10002"; transcriptId 230065 scaffold_1 JGI CDS 264882 264890 . + 0 name "estExt_fgenesh1_kg.C_10002"; proteinId 230065; exonNumber 1 scaffold_1 JGI start_codon 264882 264884 . + 0 name "estExt_fgenesh1_kg.C_10002" scaffold_1 JGI exon 265102 265158 . + . name
then i got no error for the same command line with gffread, but i still got nothing on my output file. i don't know what should i suppose to do. I used the fasta and gff3 to generate the genome index in STAR and mapped my reads to generate the BAM file, i tried to use HTseq to convert bam file to raw counts so i could do the DEG analysis , i know HTseq except the gtf file, but no such files available, so i am trying to use gffread to generate a gtf file form gff or gff3 file. thanks
i tried gff file with gffread-0.9.12.OSX_x86_64, i got lots of warning on the same command:
Warning: invalid GTF record, transcript_id not found: scaffold_999 JGI CDS 40436 40726 . - 0 name "SNAP_00037760"; proteinId 269702; exonNumber 2 Warning: invalid GTF record, transcript_id not found: scaffold_999 JGI stop_codon 40436 40438 . - 0 name "SNAP_00037760" Warning: invalid GTF record, transcript_id not found: scaffold_999 JGI exon 41056 41220 . - . name "SNAP_00037760"; transcriptId 269702 Warning: invalid GTF record, transcript_id not found: scaffold_999 JGI CDS 41056 41220 . - 0 name "SNAP_00037760"; proteinId 269702; exonNumber 1 Warning: invalid GTF record, transcript_id not found: scaffold_999 JGI start_codon 41218 41220 . - 0 name "SNAP_00037760"