Question

somatic variant calling without matched normal in long-reads

2

Entering edit mode

19 months ago

eebloom ▴ 110

When calling somatic variants in ONT cancer data (from tumours), is there grounds for using matched short-read (illumina) WGS from normal (blood) to remove germline variants if matched normal sequencing from long-reads isn't available?

Leading on from this, would it be sensible, given a wider cohort of short-read germline variant calls from many individuals, to generate a larger panel of normals to subtract from long-read tumour sequencing variant calls?

The primary reason being the expense of sequencing normal tissue if matched SR-WGS is already available...

Most studies have matched tumour-normal pairs, but I have found an example where a panel of normals is constructed from a mixture of long- and short-read sequencing data.

And another where SVs from multiple sequencing technologies were filtered against 15 healthy genomes sequenced with pacbio.

Some tools such as nanomonsv have a panel of normal function included, which makes use of 30 healthy ONT normal samples from the human pangenome reference consortium.

Then there is the question of population databases such as dbSNP and gnomAD, which are also not based on long-read data (to my knowledge)

variants cancer illumina ONT nanopore • 1.7k views

ADD COMMENT • link 9 months ago by eebloom ▴ 110

0

Entering edit mode

I like this question. Thank you for digging into this!

ADD REPLY • link 19 months ago by Ram 45k

0

Entering edit mode

See also https://github.com/KolmogorovLab/Severus for another tool using tumor-normal pairs.

I guess you could use a PoN based on short reads, but that will inherently be incomplete so you will always miss things that are only detected with long reads.

ADD REPLY • link 19 months ago by WouterDeCoster 48k

0

Entering edit mode

What about using a healthy genome sequenced with long-reads e.g. HG002 ?

ADD REPLY • link 19 months ago by eebloom ▴ 110

score 1 · Answer 1 · 2024-02-29

1

Entering edit mode

18 months ago

trausch ★ 2.0k

We are working on such a panel-of-normal SV set for 1000 Genomes samples: ONT SV analysis. We hope to release delly's, sniffles's and SVarp's long-read SV calls for this in April and then it should be straight forward to filter somatic SVs against this.

ADD COMMENT • link 18 months ago by trausch ★ 2.0k

0

Entering edit mode

Great! I know the latest gnomad release (v4.0) has been benchmarked with long-reads but actual long-read calls is even better!

ADD REPLY • link 17 months ago by eebloom ▴ 110

0

Entering edit mode

Severus have now included this panel in their somatic variant caller for tumour-only samples

ADD REPLY • link 9 months ago by eebloom ▴ 110