Genomic Bam To Transcriptome Bam
3
1
Entering edit mode
11.0 years ago
dfernan ▴ 770

Hi,

I was looking for a tool to convert a bam file in genomic coordinates to transcriptomic coordinates (using a GTF reference). Is there a fast way to do this for large bam files? Such tool would filter out reads that do not map to any transcript.

Let me know if you have any suggestions,

Thanks,
Daniel F.

bam samtools • 8.3k views
ADD COMMENT
2
Entering edit mode

You'll probably get the best answer by simply mapping your reads to the transcripts directly if having transcript-based BAM files is really your goal. If, instead, you want read counts for your genomic BAM files, that can be done with several tools. If the latter is what you want, then perhaps you can clarify your question.

ADD REPLY
0
Entering edit mode

Hi Sean, thanks. The problem is that if I map directly to the transcripts I loose the ability to build new transcripts, find splice junctions, etc. Also in many cases I am dealing with totalRNA experiments so mapping only to transcripts I loose many reads. Let us say I already mapped all my data and just want to transform coordinates, there is no way to do this? Maybe you are right and it does not make much sense that's why it does not exist...

ADD REPLY
4
Entering edit mode
2.6 years ago
opplatek ▴ 300

For others finding this thread, mudskipper is designed to do exactly what you are asking. It takes genomic BAM, gtf, and outputs transcriptomic BAM with all the alignments translated into transcriptomic coordinates. It will output multi-alignments in case genomic mapping overlaps multiple transcripts.

ADD COMMENT
1
Entering edit mode
ADD COMMENT
1
Entering edit mode
9.0 years ago

I believe you want this tool sam-xlate from UBU: https://github.com/mozack/ubu/wiki

It doesn't take a GTF, but rather a BED file in refFlat (I believe) format.

ADD COMMENT

Login before adding your answer.

Traffic: 1744 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6