FeatureCounts dont read all features
1
0
Entering edit mode
3.4 years ago
David • 0

Dear all,

I think I may have a problem importing GTF file into featureCounts cause it only import 121 features out of 3K+ features. I got my GTF files from genbank assembly (https://www.ncbi.nlm.nih.gov/assembly/GCF_000021625.1) without any modification;

> fc <- featureCounts(bam.files, isGTFAnnotationFile=TRUE, annot.ext="/Users/RNAseq/GCF_009646135.1_ASM964613v1_feature_table.gtf")

Load annotation file GCA_000016245.1_ASM1624v1_genomic.gtf ...             ||
||    Features : 121                                                          ||
||    Meta-features : 121                                                     ||
||    Chromosomes/contigs : 2    

Any idea what the problem is?

Thanks

featureCounts • 1.1k views
ADD COMMENT
4
Entering edit mode
3.4 years ago

By default, featureCounts considers only exons as features.

 -t <string>         Specify feature type in GTF annotation. `exon' by 
                      default. Features used for read counting will be 
                      extracted from annotation using the provided value.

In this annotation, there is only 127 exons... so it considers only those.

 cut -f 3 GCF_000021625.1_ASM2162v1_genomic.gtf | sort | uniq -c
3733 CDS
 127 exon
3854 gene
3727 start_codon
3727 stop_codon

To correct this behaviour, use option -t 'gene" which make more sense with prokaryotes anyway.

ADD COMMENT
0
Entering edit mode

Yes, that makes sense, and it works now... Thank you very much for your help! Cheers

ADD REPLY
0
Entering edit mode

If an answer was helpful, you should upvote it; if the answer resolved your question, you should mark it as accepted. You can accept more than one if they work.
upvote_bookmark_accept

ADD REPLY

Login before adding your answer.

Traffic: 1862 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6