This week I have had my first experience working with ABI SOLiD data and all of the wonderful subtleties of colorspace data, double encoded Fasta, etc. I know there are tools that support processing data in these formats, which makes me wonder...why? Why would you want to work with colorspace or double-encoded data if you could just convert to the traditional base space? Is this conversion lossy? Is there some benefit of encoding dinucleotides as opposed to single nucleotides?
we have a tutorial for this: http://www.biostars.org/post/show/43855/transforming-and-manipulating-color-space-reads/
Excellent tutorial by the way. BioStar is the first place I came looking for answers, and that tutorial is what gave me my first lesson in colorspace, double encoding, et al. I guess I just missed the significance of errors during my first read-through, and how this is much more manageable if the reads are kept in colorspace.