CNV detection
2
1
Entering edit mode
3 months ago
heureuse ▴ 10

Hello all,

I saw a post about detecting CNV from VCF file. People advised to avoid doing that and use a bam file instead.

My concern is I think that VCF or gVCF can be a good alternative when we have large cohort (400 000 sample for example). Algorithms based on BAM files consume a lot in terms of time and computational resources.

What do you think about that ? Thanks

ressources CNV Large_Cohort computation • 471 views
ADD COMMENT
2
Entering edit mode
3 months ago
cmdcolin ★ 4.2k

VCF files, just plain SNP VCF files for example, can provide several "signals" of information for detecting CNVs including

  • calculating the B-allele frequency ("BAF") from e.g. a normal SNP VCF, and plot this value or use it as input to a CNV detection

see https://cnvkit.readthedocs.io/en/stable/baf.html https://help.dragen.illumina.com/product-guides/dragen-v4.3/dragen-dna-pipeline/small-variant-calling/baf-output

  • calculating the read depth with the DP field from a normal SNP VCF

you can get several results looking up "detect CNV from VCF", and indeed, many pipelines like (from cursory looking: cnvpytor, Illumina/canvas) will use both BAM and VCF for adding the B-allele frequency information

ADD COMMENT
0
Entering edit mode
3 months ago
andrew ▴ 10

Vcf and gvcf are more efficient than bam files

ADD COMMENT

Login before adding your answer.

Traffic: 2493 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6