Can anyone help me unalign BAM files that have been aligned using TMAP? I want to run my raw reads through GATK best practices but the BAM files we received from our core facility are already aligned using TMAP. Thanks!
ADD REPLY
• link
updated 2.6 years ago by
Ram
44k
•
written 9.8 years ago by
Renesh
★
2.2k
0
Entering edit mode
Basically I want to
Mark and remove PCR duplicates with the Picard package.
Realign Indels with known sites and base quality score recalibration were performed with GATK (Genome Analysis Toolkit), in line with current best practices in the next-generation sequencing field for variant detection, to produce variant-caller ready reads in BAM format.
For these analysis, you don't need raw read files. you can use bam files that you have got. Sort them, if it is not. used this sorted files for MarkDuplicates and further analysis.
So you don't need to obtain reads from your bam files. If you still want to obtain raw reads use bedtools bamtofastq conversion tool.
You may not get unaligned reads in fastq files that you obtain from bam files.
ADD REPLY
• link
updated 2.6 years ago by
Ram
44k
•
written 9.8 years ago by
Renesh
★
2.2k
0
Entering edit mode
You want to run your reads through "GATK best practices" to do what?
The best thing to do is to ask the sequencing core to provide you with the original fastq files. Also, beware of the the homopolymer repeat error in the Ion torrent data.
GATK accepts sorted bam files. So, why you want to unalign to raw reads?
I want to be able to use the same methodology in a previous study on a different type of cancer.
Can you please elaborate in brief?
Basically I want to
For these analysis, you don't need raw read files. you can use bam files that you have got. Sort them, if it is not. used this sorted files for MarkDuplicates and further analysis.
So you don't need to obtain reads from your bam files. If you still want to obtain raw reads use
bedtools bamtofastq
conversion tool.You may not get unaligned reads in fastq files that you obtain from bam files.
You want to run your reads through "GATK best practices" to do what?
In the meantime, I have the following program: https://github.com/grenaud/libbam/blob/master/removeTagsMapping.cpp
I am trying to find mutations that are present in diseased tissue versus the patients blood.
so you want to run quality score recall and indel realignment and stuff ? You need mapped reads for this.
I do, but I'm trying to use the Galaxy Project and that won't work with TMAP aligned reads
The best thing to do is to ask the sequencing core to provide you with the original fastq files. Also, beware of the the homopolymer repeat error in the Ion torrent data.