Tophat Gene Models Merging
1
0
Entering edit mode
12.6 years ago
Bioinfosm ▴ 620

Am looking to use multiple human gene models for tophat, like ensembl, refseq and custom gtf files. Can one simply concatenate all these gtf (from the same genome build) and use the final file to guide tophat for rna-seq data?

These would obviously contain exact duplicates and overlaps. One can remove the exact duplicates if that helps tophat performance and efficiency, but overlaps become tricky...

thanks in advance..

tophat rna-seq • 3.3k views
ADD COMMENT
0
Entering edit mode
12.6 years ago
Wen.Huang ★ 1.2k

cufflinks has a utility called "cuffmerge" that does perhaps what you want.

ADD COMMENT
0
Entering edit mode

THanks Wen! But I think cuffmerge is at the latter end of analysis to merge the cufflinks results and annotate. What am looking for is the front end merging of different annotation sources like refseq, ensembl, etc. so I can have all the possible models to guide tophat analysis. If its not necessary to merge and no loss of efficiency, I can simply use a concatenate of all those annotations

ADD REPLY
0
Entering edit mode

cuffmerge DOES give you a union of all possible models. it takes ANY gtf files. what is better, it filters out redundant ones. but if you don't care about duplicates, then yes concatenation is the simplest way.

ADD REPLY

Login before adding your answer.

Traffic: 1920 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6