Doubt on filtering data from .VCF file
1
0
Entering edit mode
9.5 years ago
brunobsouzaa ▴ 830

Hi guys,

I'm new on exome sequencing and bioinformatics analysis so I was wondering if someone could help me. I've generated a .VCF file from my exome data and now I need to see which variant is related to my disease (ocular disease). Are there any package that can perform such analysis? I'm using microsoft excel to make some initial filtering like phred score and segregation but don't know where to go from now on!!!!

Thanks and sorry for any mispelling!

sequencing • 2.5k views
ADD COMMENT
1
Entering edit mode

Excel...? If you want to do bioinformatics analysis you should seriously consider to avoid using excel, and if you are using windows, you should consider even more to change to linux os. On the other hand, there isn't a straight way to know "OK, this variant is the one responsible of my observed phenotype". It's not as simple. First, it is important to discard as much as false positives without loosing too many true positive calls. For this, you can filter the vcf according to some parameters like, quality, coverage ...etc. You can do it using different software like snpsift, vcftools... etc. You maybe want to annotate the variants (using SnpEff, or another tool), to see the effects of those variants in the genes. Furthermore, if you have a list of genes related with the studied disease, you could extract the variants falling within those genes.

ADD REPLY
0
Entering edit mode

ADD REPLY
0
Entering edit mode

Thanks everyone.

Airan, I am using Linux os (Ubuntu) to perform the whole pipeline till I get the .VCF file! Thanks for your answer, I've found those tools on galaxy website, I'll try to use them. Also, I'll try to use R like Dhana said.

ADD REPLY
1
Entering edit mode
9.5 years ago
Dhana ▴ 110

You can try out R language, it is relatively simple to learn and efficient.

Use the package VariantAnnotation and GenomicFeatures from Bioconductor. It will be useful for your analysis.

The documentation and reference can be found in:

ADD COMMENT

Login before adding your answer.

Traffic: 2364 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6