Entering edit mode
7.5 years ago
oma219
▴
40
Hello,
I having second thoughts about how I made my annotation file I just wanted to get second thoughts. So originally, I had an annotation file in the .gff3 format that whenever I ran it with htseq-count or featureCounts would give me an error since not every line had a gene_id tag? So for getting gene counts, are the only lines I need the ones that are type exon and have a gene_id tag?
Here are the top lines of my annotation:
##gff-version 3
#!genome-build JGI ASM20922v1
#!genome-version GCA_000209225.1
#!genome-date 2007-09
#!genome-build-accession GCA_000209225.1
#!genebuild-last-updated 2007-09
NEMVEscaffold_1 JGI supercontig 1 3256212 . . . ID=supercontig:NEMVEscaffold_1;Alias=DS469507.1
NEMVEscaffold_1 JGI gene 25970 37222 . - . ID=gene:NEMVEDRAFT_v1g196074;biotype=protein_coding;description=Predicted protein [Source:UniProtKB/TrEMBL%3BAcc:A7REN5];gene_id=NEMVEDRAFT_v1g196074;logic_name=jgi_nemve
NEMVEscaffold_1 JGI gene 40959 43943 . - . ID=gene:NEMVEDRAFT_v1g196075;biotype=protein_coding;description=Predicted protein [Source:UniProtKB/TrEMBL%3BAcc:A7REN6];gene_id=NEMVEDRAFT_v1g196075;logic_name=jgi_nemve
NEMVEscaffold_1 JGI pseudogene 46093 47031 . + . ID=gene:NEMVEDRAFT_v1g5443;biotype=pseudogene;gene_id=NEMVEDRAFT_v1g5443;logic_name=jgi_nemve
NEMVEscaffold_1 JGI gene 47232 48589 . + . ID=gene:NEMVEDRAFT_v1g237689;biotype=protein_coding;description=Predicted protein [Source:UniProtKB/TrEMBL%3BAcc:A7REN7];gene_id=NEMVEDRAFT_v1g237689;logic_name=jgi_nemve
NEMVEscaffold_1 JGI gene 48731 52946 . - . ID=gene:NEMVEDRAFT_v1g43493;biotype=protein_coding;description=Predicted protein [Source:UniProtKB/TrEMBL%3BAcc:A7REN8];gene_id=NEMVEDRAFT_v1g43493;logic_name=jgi_nemve
I added markup to your post for increased readability. You can do this by selecting the text and clicking the 101010 button. When you compose or edit a post that button is in your toolbar.
Can you copy / paste the errors and commands used? Which versions?
That's a warning. Not an error.