--EDIT-- For a corrected review see next answer below
I had difficulties to retrieve the meaning of R in a cigar string from the annotation pipeline MAKER.
Looking around on internet I realised how confused was the information related to the CIGAR format. Different toosl use different operators
.
Here is the most shared and known one, the one related to the sam format:
I'm using Exonerate and here is its CIGAR format:
--EDIT-- THIS is not CIGAR but VULGAR format
Still no R... so (Helped by the MAKER developer) I finally found an old resource from FlyBase described the CIGAR format like that:
So, in order to gather all the information in one place I did a union of the different operators and end-up with this last table, hoping it would help some lost souls like I was:
--EDIT-- THIS table contains VULGAR format operators and the H from the CIGAR format is missing
I haven't checked carefully if some definitions can be contradictory (e.g for F and I), so any comment or correction is very welcome.
Thank you. Could you please post data as text and not as images, or put it as text file to GitHub and share the link.
Here is a link where you can have access to the tables.
Thank you for the post. Could you add/create a section on how to catch aligned bases according to the CIGAR, like shown in this post or this one. For example to highlight the fact that H and S do not impact the start position of the alignment.
Ref :
Query :
CIGAR :
With a starting position at
1
on the reference, the bases covered on the reference are :(1,7), (13,15)