Hi guys,
I want to calculate repeats expression in my RNAseq data. I've obtained bam files using TopHat and now I need gtf file for repeats to calculate the counts. Where can I download it?
Thanks!
Hi guys,
I want to calculate repeats expression in my RNAseq data. I've obtained bam files using TopHat and now I need gtf file for repeats to calculate the counts. Where can I download it?
Thanks!
Go to UCSC
and under Tools > Table Browser
Choose your genome and track (ideally RefSeq genes), and select "Output format: GTF"
Actually I downloaded an archive with genome, Bowtie2 indexes and other files here: ftp://ussd-ftp.illumina.com/Mus_musculus/UCSC/mm10/ So it is UCSC as far as I understand
Hi Seta,
I simply use tophat2 to map the reads to reference genome. Then I sort my reads using samtools and use htseq-count to obtain counts from bam file. On this stage I needed gtf file we discussed here. Then you can the apply any normalization to counts, I prefer DeSeq. Let me know if still you have questions.
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Remember to get a GTF file that matches your genome. If your genome came from Ensembl then you need to get the GTF from ensembl. Chromosome identifiers may otherwise not match.
BTW: Repeat tracks are under "Variation and Repeats" group in UCSC table browser.