Bioscope Output Format?
2
0
Entering edit mode
12.6 years ago
khikho ▴ 100

I want to extract reads and ColorQualitys from unmapped reads of bioscope output, here is one line sample :

788_2043_1964    141    *    0    0    *    *    0    0    *    *    RG:Z:20110319033913614    CQ:Z:%+&%%&(%()%&%&%%-).+&'2-0)%485/(%).    CS:Z:G31102022112011103111111111011111111

As I guess 14th and 13th fields should be reads and quality respectively , Shouldn't it !? If so , Can you tell me why quality is not in the numeric format!?

mapping • 1.8k views
ADD COMMENT
1
Entering edit mode

Bioscope is an analytics tool sold by ABI - few people have access to it. Your best bet is to contact ABI support.

ADD REPLY
0
Entering edit mode
12.6 years ago

It's phred scale quality scores. The numeric format that the original .quals files are in takes up too much space and wouldn't fit into the .sam specifications.

These quality scores look like Sanger type phred quality scores as it includes the '%' character which is I think ascii 37.

If you really want numeric data, you'll have to convert the phred scale quality scores. You can use the python 'ord' function to get the numeric value of the ascii character. For example:

ord('%')

Would return 37. Then you just have to subtract 33 (Sanger phred scale is 33-126) from that ascii value to get the numerical score.

ADD COMMENT
0
Entering edit mode
12.6 years ago
GAO Yang ▴ 250

Which results? the mapping result for BioScope is just .bam format,I think~

ADD COMMENT

Login before adding your answer.

Traffic: 1806 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6