Hi all,
I have the information of whole-genome sequencing (BAM files) of about 1100 individuals (human) that should be processed to find all variants of specific genes. I read and browsed a lot, however as it’s my first experience in this filed, I haven’t obtained a final conclusion. Could you please kindly share me your experiences about the following issues?
1) Is it better to do variant calling analysis on the whole BAM file or extract the region of interest from BMA file by a tool such as VariantBam enter link description here. However, I found just one citation for this tool, if you suggest extracting the region of interest from BMA file, please let me know any alternative tool for this purpose?
2) For variant calling analysis, please kindly tell me how to process 1100 BAM files? Processing each BAM file for variant calling, then merge resulting vcf files or merging all BAM files and doing variant calling analysis on this merged BAM file, which one do you suggest?
3) Finally, how I can match the discovered variants with all previously identified variants for genes of my interest? Please share me your suggested tool.
Many thanks in advance for any help
Are you looking for germline or somatic variants ?
If you end up using GATK, regarding your focus on specific genes, you can use an interval_list to restrict the search (and save time).
Thanks, looking for germline variant.