Hi,
I have two bam files from ICGC. During mutation calling, vardict/mutect/freebayes/varscan split bam to by its chromosomes. Those GL00 parts also appear just like chr1-2-3-4-5-6-7 appear. I check what they are and found they are contigs?!?(forgive my ignorance) How should I treat them ? Will they cause a problem during variant calling output interpretation? I read one previous thread about this but there were not any information about how to treat them. I need your help.
Best,
Tunc.
Thank you very much !
Igor,
I have a question. Can I remove those locations from my bam file ? Would that be feasible ? since they have unknown location would that be valuable for obtaining any data?
Thank you for the information again,
Best,
Tunc.
You may then lose multi-mapping reads or reads that would have aligned to regular chromosomes if the random ones were not present.
If you really want to do that, you should be able to do something like
samtools view -h file.bam | grep -v "_random" ...