Sometimes I've interested in highlighting specific positions within the sequence fields of a BAM file using something like IGV. Is there any standardized way to accomplish this with e.g. a SAM tag such as:
YY:B:(array of integers specifying positions to highlight)
In general there seems to be a lack of display and styling metadata for SAM, and given that formats like BED have numerous (sometimes loosely defined) styling options (see UCSC color format). FWIW I'm aware of the UCSC-sanctioned "YC" tag, which sets the color for the entire alignment and is recognized in IGV, but I'm thinking of something that would affect only certain positions in the alignment. Thanks!
Quick question: Can you point me to documentation about the YC tag?
A follow-up: We could probably add a simple plug-in to IGB (bioviz.org) that would enable color-coding bases using a YY:B: attribute. Already there is a mechanism to color-code bases by score, so adding this would not be too hard. IGB also supports SAM. (No need to make an index.)
Certainly. The UCSC documentation for that tag is here. Don't take me literally about designating a "YY" tag with a byte array type code, as this is just a hypothetical example.
Keep in mind that since tags beginning with Y are custom, it's likely that some aligner already uses that (though probably not with a byte array). This might be a good excuse to add a new auxiliary tag to the specification.
:) Yes, I already struggle with other people using X/Y/Z prefix tags. In fact, it seems like some of the X-prefix tags have become standardized enough that they should have been included elsewhere in the tag namespace of the specification.
Yeah, the XS tag is a good example of that.