Question

Post filtering analysis for exome data

0

Entering edit mode

3.7 years ago

priya.bmg ▴ 70

Hello

I am following GATK pipeline to process exome data set. I am done with preprocessing step and filtered the dataset by hard filtering method. Now, I am looking for variants shared between the affected individuals. In the vcf file, I get the variant allele position and genotype in the affected individuals. For example, variant is G allele and reference allele is C, genotype is given as CC in individual 1, CG in individual 2, GG in individual 3 and CG in individual 4. So, here except individual 1, all the individuals share the variant allele. This is how I have to look for shared variants between affected individuals?.

After the hard filtering step, I should retain the variants that has high quality score in the vcf for subsequent analysis?. The variants has the label PASS label and use for subsequent annotation analysis?. Please advice

Thanks

shared rare Hard-filtering variants • 834 views

ADD COMMENT • link updated 3.7 years ago by Pierre Lindenbaum 166k • written 3.7 years ago by priya.bmg ▴ 70

0

Entering edit mode

please, stop asking questions without validating the answers or commenting

Decoy sequence for human reference GRCh38 assembly

Read group info

Alignment using bwa-mem2

Base recalibration -Java run time error and no sequence dictionary

Snakemake-Aligment using BWA-MEM2

etc...