Question

Tumor purity estimation by allele frequency of COSMIC identified somatic mutations

1

Entering edit mode

7.2 years ago

ejoffe ▴ 20

Hi,

This is very possibly a layman question …..

I have a MAF file with sequencing data for lymphoma specimens. I have no data regarding the tumor purity of the samples. There are no matched normal samples. Germline mutations have been filtered out based on mapping to the 1000 genome. Each mutation is mapped to COSMIC and tagged as pathogenic, likely pathogenic or unknown.

I have read about the various tools for estimating sample purity from the sequencing data (e.g., CNVkit, THetA2, FACETS etc.).

However, I was wondering if there is an approach that uses the fact that the COSMIC mapped somatic mutations are supposed to be unique to the tumor cells in order to estimate the tumor purity and to normalize the values of the allele frequencies.

Thanks, E

R sequencing next-gen • 3.3k views

ADD COMMENT • link 7.2 years ago by ejoffe ▴ 20

1

Entering edit mode

Germline mutations have been filtered out based on mapping to the 1000 genome

This only excludes COMMON variants that are present in 1KG. Still, having a matched normal, you would identify thousands of germline mutations that are not covered by 1KG in a WGS sample. Without matched normal, there is no way to discriminate somatic from germline variants.

ADD REPLY • link 7.2 years ago by ATpoint 85k

0

Entering edit mode

Correct. See the recent ISOWN paper, where they tried really hard to distinguish germline from somatic in tumor-only samples and still, lots of germline events slipped through.

ADD REPLY • link 7.2 years ago by Chris Miller 22k

0

Entering edit mode

Partly because they don't adjust for purity and copy number, as they state in the Discussion. With normal contamination, there are ways to discriminate somatic from germline. At least there are ways to calculate those probabilities accurately.

ADD REPLY • link 7.2 years ago by markus.riester ▴ 550

1

Entering edit mode

COSMIC mapped somatic mutations are supposed to be unique to the tumor cells

There are actually many germline variants in COSMIC, since a lot of them have never been validated. COSMIC mutations have a "confirmed somatic" field to distinguish truly somatic from questionable.

ADD REPLY • link 7.2 years ago by igor 13k

score 1 · Answer 1 · 2017-09-09

1

Entering edit mode

7.2 years ago

ejoffe ▴ 20

Thank you all for your answers !!!

ADD COMMENT • link 7.2 years ago by ejoffe ▴ 20

score 0 · Answer 2 · 2017-09-08

And how will you know if those mutations are in the founding clone or a subclonal population? Or if they are copy-number altered, skewing their VAFs up or down, depending on which allele is lost? Or perhaps both CN-altered and Subclonal?

Purity, ploidy, and copy number inference are all inextricably linked in tumor samples, which is why those more complex methods exist.