I have not had a chance to try PEAR.
I assume some people would like to merge PE reads into SE reads when most of the inset size is less than 2*read length.
1) How PEAR treat un-overlapped PE reads? discard them?
2) What kind of advantage PEAR brings? I imagine the mapping accuracy won't change much. Structural variants are less accurately identified without pairing information. I can't imagine anything good out of this implementation...
:) Depends on the assembler, but yes, it can be very useful. It's also useful for identifying longer insertions when calling variants.
This is useful for identifying longer insertion only because the insert size is shorter than the sum of paired reads, right? Otherwise, The paired end reads plus the known insert size would provide more information when identifying long insertion. Am I right?
Specifically, a normal aligner can only call insertions in cigar strings if the insertion is shorter than read length, and a variant caller based on cigar strings will only call variants recorded there by the mapper. So the longer the reads, the longer the insertions that can be called. When paired reads have an insert size longer than read length but less than double read length, they can be merged by the overlap to produce a single longer read that allows longer insertions to be called.
I guess in best case the insert size should longer than the sum of paired end reads so that the insert size can server as additional information when calling structure variant (longer insertion).
When the interested region (amplicon) is short and insert size can't provide additional information, merging them into SE would be an alternative choice, right
Yes, it makes analysis much simpler too.