Variant calling analysis on phased assembly
1
0
Entering edit mode
17 months ago
pablo ▴ 310

Hi,

I have yeast assemblies : phased assemblies from hifiasm and "haploïd" assemblies from ipa assembler for the same samples. Both method gives good metrics : about 12.5Mb size assembly and < 30 contigs. For information, it is diploïd samples.

I would like to : do some variant calling first, and then, detect if there are some LOH events between my sample generations. I need to use one of my sample as reference genome to align my reads, because it is the "generation 0" sample.

What I did :

  • assembly (both hifiasm and ipa)
  • align my reads using pbmm2 to the reference
  • variant calling using pbsv

Now, should I only the "diploïd" assembly as reference? I think it could better because :

  • if I use the "haploïd" assembly : one heterozygous variant present in my reads should be recovered in one of my haploïd copy assembly. Then, during my reads alignment step, the same het variant from other samples will be detected in any case : for example, for a deletion ; detected once as a deletion and once as an insertion. But for this same homozygous variant (LOH event), I will be able to detect it only if the retained haploïd copy does not contain the variant.
  • it is always the main problem, using as reference genome one copy of a polyploïd genome?
  • any LOH events could be recovered suggesting het variants are correctly separated in my phased reference assembly?

Any suggestion?

Best

variant-calling bam pbsv • 805 views
ADD COMMENT
0
Entering edit mode
17 months ago

This isn't easy. I don't think the world of bioinformatics has a plan at present for performant (small) variant calling on multiple diploid references.

Maybe the toolset which currently comes closest is PGGB, which is intended for pangenomes. You get an odgi pangenome, and VCF with variant calls out of your pangenome (created from multiple fastas). Carefully consider naming of your haplotypes though before starting.

Minigraph or odgi pav might also be useful if you are only interested in larger variations like SVs.

ADD COMMENT

Login before adding your answer.

Traffic: 1808 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6