variant calling through gatk
0
0
Entering edit mode
4 weeks ago
ashaneev07 ▴ 40

Hello all,

I have been working with a sample dataset and have developed a workflow that generates a VCF file through GATK. This is typically used for identifying genetic variants in human samples. It would mean a lot to me if anyone could kindly review my commands, as your feedback would help boost my confidence in my approach. Only then can I apply this to the original sample.

Thank you..

    bwa mem -t 4 -R "@RG\tID:sample003\tPL:ILLUMINA\tSM:sample003" /hg38/hg38.fa sample_R1.fastp.fastq.gz sample_R2.fastp.fastq.gz  > s
```
\tPL:ILLUMINA\tSM:sample003" /hg38/hg38.fa sample_R1.fastp.fastq.gz sample_R2.fastp.fastq.gz | samtools view -Sb - > sample003.paired.bam 

samtools view -Sb sample003.paired.sam > sample003.paired.bam
samtools sort sample003.paired.bam -o sample003.paired_sorted.bam

gatk MarkDuplicatesSpark -I sample003.paired_sorted.bam -O sample003.sorted_dedup_reads.bam 

gatk BaseRecalibrator -I sample003.sorted_dedup_reads.bam -R  /hg38/hg38.fa  --known-sites  /hg38/Homo_sapiens_assembly38.dbsnp138.vcf -O recal_data.table 

gatk ApplyBQSR -I sample003.sorted_dedup_reads.bam -R /hg38/hg38.fa --bqsr-recal-file recal_data.table -O sample003_sorted_dedup_bqsr_reads.bam 

gatk CollectAlignmentSummaryMetrics R=/hg38/hg38.fa I=sample003_sorted_dedup_bqsr_reads.bam O=alignment_metrics.txt

gatk HaplotypeCaller -R /hg38/hg38.fa -I sample003_sorted_dedup_bqsr_reads.bam -O 53_raw_variants.vcf 
variant-calling vcf workflow gatk • 226 views
ADD COMMENT

Login before adding your answer.

Traffic: 1740 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6