cnvkit segmetrics on multiple files
2
1
Entering edit mode
7.1 years ago
adoak ▴ 10

Is there a way to run segmetrics on multiple files at once? or incorporate it into batch? Additionally, I have certain segments or chromosomes that keep showing up as having no variation. Is this a problem with mapping?

cnvkit cnv • 1.8k views
ADD COMMENT
0
Entering edit mode
7.1 years ago
Eric T. ★ 2.8k

It's not currently part of batch, so I'd recommend running it in a loop in Bash or a Makefile.

No variation, or no reads? The latter would be an issue with mapping, the former maybe not. Which chromosomes or regions are causing trouble? I'd expect that regions close to centromeres and telomeres, as well as chr6 and chrY, would be noisy. Using more samples in your pooled reference can help to both reduce noise at these regions and improve the sensitivity of calling in the rest of the genome.

ADD COMMENT
0
Entering edit mode
7.0 years ago
adoak ▴ 10

In the .cns data it looks like very low variation, but when mapped to a diagram the gene names do not show up and the sections are blank. I'm missing a section of chromosome 15, 14, and X. The sample size is currently around 30, maybe increasing would help. The missing sections are consistent for each sample.

ADD COMMENT
0
Entering edit mode

Are those missing regions the centromeres?

Some portions of each chromosome are unmappable, i.e. difficult to sequence reliably. In the reference genome sequence (FASTA) these are masked out with 'N' characters, and in your BAM you will see there is no sequencing coverage in these regions. CNVkit excludes these from its output, so you can expect see holes in your data around these regions.

ADD REPLY

Login before adding your answer.

Traffic: 1580 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6