Ensembl GTF for Mouse that includes intron, exons, utrs, etc.
0
1
Entering edit mode
7.9 years ago
achamess ▴ 90

I have some RNA-seq data that i aligned using STAR and the Ensembl GRCm38 genome. So, for counting with Htseq, I was going to use the corresponding ensembl gtf. My data is polyA selected, but there is a lot of unspliced RNA, and so a lot will be intron (it's from nuclei). If I just count exons, I'll miss a lot. I want to do a gene level counting. The ensembl GTF file has transcript and gene. I'm assuming gene means the coordinates comprising (exon + intron), but if I wanted to get just introns, how could I do this? I don't see an ensembl GTF that has this as a feature.

RNA-Seq • 3.4k views
ADD COMMENT
0
Entering edit mode

Try setting -t transcript, which I expect will result in htseq-count using intronic alignments.

ADD REPLY
0
Entering edit mode

Thank you. What then is the feature type 'gene' signifying? Is that just exon?

ADD REPLY
0
Entering edit mode

Do you mean -t or -i? You could also set -t gene, I presume, and have the same result.

BTW, featureCounts is much faster.

ADD REPLY
0
Entering edit mode

Here is what I did:

htseq-count --format bam --order pos --mode intersection-strict --type gene --idattr gene_id $ALIGN/STAR- alignments/AC3-Aligned.sorted.bam $REFS/ensembl/Mus_musculus.GRCm38.87.gtf > $RNASEQ/expression/htseq- ensembl/AC3-counts.tsv
ADD REPLY
0
Entering edit mode

My expectation is that that'll do what you want.

ADD REPLY

Login before adding your answer.

Traffic: 1951 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6