How to detect de novo variants with trio-data
2
0
Entering edit mode
2.1 years ago
Shin Taguchi ▴ 40

Hi, all.

I have a trio-dataset of a kind of fish with whole genome sequence: father, mother and offspring. I want to get information of de novo variant sites from this trio-dataset.

How can I check for variants present in descendants that are not present in either parent, and look for variant sites with Mendelian violations?

I'd appreciate any ideas on creating a methodology to go about this analysis.

WGS trio denovo variants • 1.3k views
ADD COMMENT
0
Entering edit mode
2.1 years ago
pragnapcu ▴ 10

Look for KGGSEQ

ADD COMMENT
0
Entering edit mode

Thank you for your advice!

ADD REPLY
0
Entering edit mode
2.1 years ago

gatk: gatk VariantAnnotator -R "ref.fasta" -V fam.vcf -A PossibleDeNovo -O fam.denovo.vcf -ped fam.ped

bcftools https://samtools.github.io/bcftools/howtos/plugin.trio-dnm2.html

ADD COMMENT
0
Entering edit mode

@Pierre Lindenbaum

Thank you for your comment.

When using GATK, is the file specified in option -V a VCF file with the trio (father, mother, child) merged and variants called?

ADD REPLY
0
Entering edit mode

merged and variants called? yes

ADD REPLY
0
Entering edit mode

Thank you very much!

I'm considering the following pipeline in broad terms. How do you think about this?

Raw data

Adapter trim (Trimmomatic)

Mapping (BWA)

Sorting and removing duplicates (samtools and picard)

Gnenotyping each individual (GATK: HaplotypeCaller)

Merging trio (GATK: CombineGVCFs)

Joint genotyping (GATK: GenotypGVCFs)

Select variants (GATK: SelectVariants)

Hard filtering (GATK: VariantFiltration)

Selecting de novo variants (GATK: VariantAnnotator)

ADD REPLY

Login before adding your answer.

Traffic: 2531 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6