Dealing with a large sample for variant calling
1
0
Entering edit mode
6.0 years ago

I have a large number of samples. Like a 100 samples at least (plus these are paired end reads). I aim to call variants on these samples and then predict their effects on protein structure dynamics.

The only way that seems possible for now is to align each sample individually, pre-process them individually, call on them individually and then combine them into a gvcf for analysis.

This however, seems very time intensive and computationally cumbersome. What would be the alternatives to this ?

I'm currently using standard bash script commands and plan to use various tools, viz. GATK, freebayes, varscan 2, pindel, etc.

NGS alignment variant-calling joint-calling • 1.5k views
ADD COMMENT
0
Entering edit mode
6.0 years ago

This however, seems very time intensive and computationally cumbersome. What would be the alternatives to this ?

GATK hapcaller in gvcf mode: https://software.broadinstitute.org/gatk/documentation/article.php?id=3893

enter image description here

ADD COMMENT

Login before adding your answer.

Traffic: 2792 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6