I have targeted sequencing from 2 different projects. One project was sequenced on SOLiD and the other on Illumina. I have independently called variants on both projects using GATK. Is is possible to combine the called variants (.vcf files) to allow for association analysis with a larger sample size? Do you have a recommendation on a tool that can combine files?
Thanks!
Can you describe more what you mean by LD-pruning and PCA checks?
I've used EIGENSTRAT to do PCA, but there are other tools that do this. If you plot your subjects along the first 2 PCs and they are clustered by sequencing technology then you don't want to use the combined data set to do association tests. Before you do PCA you want to do LD-pruning (can be done with EIGENSTRAT).