Strategy to find transgene copy number and integration sites using only Nanopore simplex reads?
1
1
Entering edit mode
14 days ago
Mark ▴ 20

We've recently transformed an organism and have transgenic strains growing. We suspect some strains have higher copy numbers of the whole genome but we don't know how many copies or where these genes are going. Our lab has a Oxford Nanopore Ligation Sequencing Kit v14 (SQK-LSK114) and this is all we'd like to use to answer this research question.

Here are some facts about the organism.

Does anyone have any recommended strategies for how to do this? Could copy number be solved by using a k-mer spectra analysis (I don't know if it's possible with Nanopore singlex reads due to poor read quality)? Would read mapping to a reference be enough or should some sort of draft assembly be built before mapping? Are there any softwares/pipelines out there that specialize in doing this for nanopore? What would be your guy's recommended strategy for this problem?

sequencing transgenes ngs • 359 views
ADD COMMENT
0
Entering edit mode
10 days ago

Just build an assembly, it should work great using Flye.

Dorado correct might help as well if you have R10.4.1 data and want an extremely high QV assembly.

I don't have any great ideas how to solve your whole genome copy number assembly.

  • Maybe look at conserved genes and see if they are duplicated and show mutations ?
  • Do a diploid assembly after dorado correction with hifiasm ?
  • Align reads to your assembly and check for "heterozygote SNPs" eg with the tool longshot

You should be able to find your integration sites using blastn.

ADD COMMENT

Login before adding your answer.

Traffic: 1986 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6