Hi all,
I have recently sequenced the exact same sample (within the space of 24 hours) using an ABI 3130 sanger sequencer to sequence a given region as well as by NGS analyses (Nextera PE on a nextseq 500). The sample was a lyophilized live attenuated vaccine (Highly passaged DNA double stranded virus)
To my surprise (and disappointment), the sequences don't match. I've attached a screenshot of the Sanger sequence aligned to the reference genome (image 1). There don't appear to be any peaks under peaks that suggest a heterogenous sample. I've also attached the mapped reads of the Illumina data, and there are no reads that contain the two variations seen in the Sanger data.
Anyone have any idea about what could be going on?!
Thanks in advance
Show the bit to the left on the top one. Given the other samples in there, it looks like the polyT quality is decreasing toward the end in the sanger reads and you're just missing a T call.
Or do you mean the G->A and T->C SNPs?
Apologies, I mean the the G>A and T>C SNPs. The first columns of both the mapped reads are the positions corresponding to the SNPs in the NGS data (I know it's quite hard to see from the images). But there was no evidence of an Adenine in any of the mapped reads in the first column of image 2 (corresponding to the G>A SNP) or any evidence of a Cytosine in any of the mapped reads in the first column of image 3 (corresponding to the T>C SNP).
In addition to the reply from swbarnes2, please comment on the mapping quality of the Illumina reads. Also, do you have sanger reads in the other orientation or from a separate primer site that cover these positions?
Can you add a bit of information about how you aligned the reads? The Sanger reads were aligned to the genome or to a locus?
I'm not so familiar with viral sequencing though.
The sanger reads have been aligned to the exact same genome (using the function in ugene) used to map the NGS reads (using bowtie2). I also conducted a de novo assembly, and used Bandage to blast search the contigs, but I couldn't find the other sequence anywhere in the NGS reads.