Question about exon/intron counting in RNA-seq dataset analysis
2
0
Entering edit mode
5.6 years ago
nanoide ▴ 120

Hi, So I'm currently analyzing some RNA-seq datasets and wanted to get expression at the gene level. So I got raw counts with featureCounts and then got normalized counts using DESeq2.

My question is, I have huge genes with very high introns. I'm afraid the normalization is taking into account the whole gene length, so expression in exons of genes with huge introns would be underrated. I believe DESeq-2 normalizes depending on the total on reads but I don't know if the fact that I have huge introns might be confounding in oter parts of the analysis. Maybe I can perform the analyses at the exon level and then somehow combine the counts per gene? Don't know if that's crazy

Any thoughts? Thanks for any help

RNA-Seq htseq-count featureCounts • 2.3k views
ADD COMMENT
5
Entering edit mode
5.6 years ago
caggtaagtat ★ 1.9k

I think you can use featureCount to counts reads on exons and then summarize data of all exons of one gene. So intron lengths should not be a problem.

Edit: as long as there are exons annotated in your gtf file

Each entry in the provided annotation file is taken as a feature (e.g. an exon). A meta-feature is the aggregation of a set of features (e.g. a gene). The featureCounts program uses the gene_id attribute available in the GTF format annotation (or the GeneID column in the SAF format annotation) to group features into meta-features, ie. features belonging to the same meta-feature have the same gene identifier.

ADD COMMENT
0
Entering edit mode

Thank you for your useful answer! Will take it into account. Regards

ADD REPLY
1
Entering edit mode
5.6 years ago

DESeq knows nothing about your features except the names. How could it? You never give it that information. If the gtf you gave to FeatureCounts had proper exon and intron information, you are fine.

ADD COMMENT

Login before adding your answer.

Traffic: 1918 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6