How to check and extract if a SNP is heterozygous or homozygous in a vcf file from the SNP ID
0
0
Entering edit mode
8.1 years ago
akang ▴ 110

I have a vcf file and a list of SNP ids. I checked if SNPs are heterozygous/homozygous in my samples.

vcftools --vcf my vcf file.vcf --snp snp1 --extract-FORMAT-info GT | grep "0/1"

Now, I want to extract the sample ids that are either 0/1 or 1/1 and also i want to run an odds ratio test? Is there a way to do that in vcf tools?

VCF SNP • 5.0k views
ADD COMMENT
0
Entering edit mode

What is the format you need the data in ?

If you do not want to write a simple pysam script, you could do something like:

First clean the VCF such that you will end up with only het SNPs.

vcf-subset --exclude-ref --type SNPs in.vcf.gz > out.vcf

Then you could use the vcftools and no need to grep.

ADD REPLY
0
Entering edit mode

To grep multiple patterns you can use the following command:

egrep "0/1|1/1"
ADD REPLY

Login before adding your answer.

Traffic: 2636 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6