Hi all,
I am trying to get SNP locations. What I have done is that downloaded the dbsnp database in the ensembl and used the command line
grep -f snps.txt homo_sapiens_snp.vcf > matches.vcf
to get my target SNP info. There are about two million SNPs in my snps.txt, and all snps info in dbsnp database are stored in the file "homo_sapiens_snp.vcf". The problem is that the process got to be Killed
and I cannot get the snps' locations. Do you guys know how to fix this?
Thank you in advance!
Try the suggestions from the thread Question: how to get a subset of vcf file for specific SNPs.
Please tell us if those work or not. If they don't, also tell us how much memory do you have available.