Hi . I am using 'TABLET SEQUENCE VIEWER' now. but I don't really understand about difference between 'clipping' and 'unmapped' and 'skipped' with CIGAR. And if any CIGAR value is '36S76M" , then what's going on 37~75 base?? Thank you.
Hi . I am using 'TABLET SEQUENCE VIEWER' now. but I don't really understand about difference between 'clipping' and 'unmapped' and 'skipped' with CIGAR. And if any CIGAR value is '36S76M" , then what's going on 37~75 base?? Thank you.
36S76M
means 36 nucleotides are soft-clipped (nts that are not part of the alignment but present in the read) + 76 alignment matches.
I suggest reading the SAM specifications document. Page 1 and 2 have a good example that addresses your question about alignment and CIGAR operators.
Unmapped information always shows in FLAG with 0x4, with no information in CIGAR.
clipping(S/H) and skipped(N) information shows in CIGAR.
Taking an example,
if read A is unmapped, it's FLAG includes 4(Bit) and it's CIGAR is '*'.
if read A is mapped, but only part of it is mapped, it's FLAG does not include 4(Bit) and it's CIGAR is like '36S76M'(means 36nts is unmapped to reference but these soft-chipped sequences are stored in SEQ).
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
If a read is unmapped (0x4), the CIGAR can be any legal CIGAR. The SAM specifications do not require a CIGAR of
*
for unmapped reads.