Hello,
I was running HTSeq count script and I keep getting an error saying "Error: Feature EDO49941-1 does not contain a 'gene_id' attribute"? I'm guessing its because I used a .gff3 instead of .gtf. So if I were to convert the file, should I use the method makeTxDdfromGFF() from the genomicFeatures package or is there a better way?
UPDATED QUESTION: I tried using the -i command line option but keep getting the error below, so I was hoping some one just help me figure out how to use the -i command line option so it works with my .gff3 file? Thanks.
Error: Feature EDO49941-1 does not contain a 'ID' attribute
Here is the command I used:
htseq-count -i ID /home/ubuntu/data/rnaseq/nematostella/venus/alignment/1_STARAligned.out.sam /home/ubuntu/data/rnaseq/nematostella/venus/genome/Nematostella_vectensis.GCA_000209225.1.33.gff3
Here are a couple lines of my .gff3 file:
NEMVEscaffold_1 JGI gene 25970 37222 . - . ID=gene:NEMVEDRAFT_v1g196074;biotype=protein_coding;description=Predicted protein [Source:UniProtKB/TrEMBL%3BAcc:A7REN5];gene_id=NEMVEDRAFT_v1g196074;logic_name=jgi_nemve
NEMVEscaffold_1 JGI gene 40959 43943 . - . ID=gene:NEMVEDRAFT_v1g196075;biotype=protein_coding;description=Predicted protein [Source:UniProtKB/TrEMBL%3BAcc:A7REN6];gene_id=NEMVEDRAFT_v1g196075;logic_name=jgi_nemve
NEMVEscaffold_1 JGI pseudogene 46093 47031 . + . ID=gene:NEMVEDRAFT_v1g5443;biotype=pseudogene;gene_id=NEMVEDRAFT_v1g5443;logic_name=jgi_nemve
NEMVEscaffold_1 JGI gene 47232 48589 . + . ID=gene:NEMVEDRAFT_v1g237689;biotype=protein_coding;description=Predicted protein [Source:UniProtKB/TrEMBL%3BAcc:A7REN7];gene_id=NEMVEDRAFT_v1g237689;logic_name=jgi_nemve
Thanks.
Have you tried
-i gene_id
which is the default.Yeah I started with that, -i gene_id. But it says "gene_id" is not found so I wasn't confused? I thought I may need to change it since it was .gff3 file.