How to subset the mutations that occur just in my interest genes within the vcf file?
1
0
Entering edit mode
2.4 years ago
Zahra ▴ 110

Hi all,

I have a vcf file that I perform the BED file by vcftools to subset the mutations in my interest genes. I have downloaded the BED file from the UCSC (selecting the BED file for the whole gene of my gene list). However, after performing the vcftools, I have found some mutations in genes that weren't in my gene list. How to subset the mutations that occur just in my interest genes?

Thanks for any help.

vcf vcftools BED • 706 views
ADD COMMENT
1
Entering edit mode
2.4 years ago
ATpoint 85k

Extract Sub-Set Of Regions From Vcf File

Use tabix, http://www.htslib.org/doc/tabix.html, and see its -R option to only retrieve VCF entries that overlap the BED file you provide for that argument. That would need to be a BED file with the coordinates you are interested in, e.g. the entire gene interval, or its exons. You could get the coordinates directly from a reference GTF file.

ADD COMMENT

Login before adding your answer.

Traffic: 1830 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6