We have performed error prone PCR on a segment of a gene ( 800bp) and selected colonies with phage display. To identify the mutations that improve the function of the gene we decided to perform nanopore sequencing as we are interested in identifying mutations patterns on the same molecule.
I use minimap to align to genome and then tried variant calling with clair3
, However the VC returns no variants ( there is no error just the vcf file is empty) I was wondering how could I resolve this issue ( use diifferent tools, different option? ) I would appreciate your opinion on this.
I have attached the files here and would appreciate your opinion. bam files
Thank you so much for your response. I forgot to check the alignment statistics. I assume there were too many mutations during the process, I tried realiging with more lenient options (match =10, gap = -4, mismatch = -2) but still no read was mapped. Do you have any suggestions ?
it is very unlikely that the number of mutations caused the reads to fail to align. The number of mutations needed to cause that would be very high - say above 20-30% and when using long reads you would still have more regions that align.
You should not need to be more "lenient", beyond selecting the nanopore specific setting in minimap, which you have already done.
When reads do not align it typically means either that the sequenced DNA does not match the reference genome - for example there are other off targets or contaminants, or host related sequences that dominate.
or that the sequencing has failed.
Thank you so much for your comprehensive response and tips. I reached out to the person who designed the experiment and turns out that the sequence was codon optimized for expression in yeast so I guess that's why alignment failed. After aligning to the modified sequence of gene, alignment performance was almost perfect.