I'm having trouble to understand the meaning of having overlapping HSP after a BLAST.
The thing is: I have a sequence from NGS (Illumina Miseq) and assembled with SPAdes 3.6. When I BLAST it I got overlapping HPS. The contig I get is a bit bigger (around a hundred nucleotides) than the BLAST hit indeed, but since HPSs overlap, shouldn't they have been assembled previously? I mean is that an assembly issue(/error) or something else?
I thought it could be a disassembly because that case happened often (18 out of 25 contigs) and for genomes that don't have a large diversity (I think it should eliminate the assumption of a real duplication).
I may have forgotten that I'm working on one viral family, circular genome, and what baffles me the most is that concerning18 out 25 contigs (having the size of an entire genome from this family) the corresponding overlapping HSPs seem back to front regarding reference genomes (blast hit): the end of the contig is aligned with the beginning of the ref genome and vice versa for the start of the contig and the end of the ref genome.
At first glance I thought it was all about circularity (since I verified the order of the ORFs in the contigs compared to the one from the original genomes, and it was the same (don't know if I'm making myself clear)), but I got rid of the part in the contig that make the genome circular (the biggest repeat located at the extremities of the contig) and after re-assembling I still have this "overlapping-back-to-front-hsps" situation.
I really don't know what to think about this! I'm still torn between misassembles or just a circular issue, that's why I wanted to know more about the meaning of those overlapping HSPs.
It seems you could design a pair of primers to solve this question: make the primers flank the offending region, and design them to make easy differing the expected PCR sizes (with and without duplication) on the gel.