Codons from rna seq bam file?
1
0
Entering edit mode
5.0 years ago
protprot • 0

I am trying to identify codons from certain regions of interest in my bam files. Are there any tools to do this?

Linux • 1.1k views
ADD COMMENT
0
Entering edit mode

Is the reference a genome or transcriptome? Is the reference annotated? By "identify codons from certain regions", you mean you want to infer the most likely codon from the reads?

Can you provide a more concrete example?

ADD REPLY
2
Entering edit mode
5.0 years ago
Eric Lim ★ 2.2k

Not sure if there's a tool that would turn bed into codons or amino acids. If you were to do this yourself, bedtools getfasta (https://bedtools.readthedocs.io/en/latest/content/tools/getfasta.html) is one of the many ways to turn coordinates into sequences. Unless you're translating in all 6 frames, you'll need strand and reading frame. Both can be found in columns 7 and 8 of a gff file. Also, be aware of 0- or 1-based coordinates to make sure you're not off by 1. One last thing. If your coordinates contain spliced junctions, you'd need to parse and get only coding sequences, which could quickly complicate the process. This isn't likely a trivial exercise for someone who is very new to python, or programming in general.

ADD COMMENT

Login before adding your answer.

Traffic: 2111 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6