Hello,
I have downloaded a published MNase-seq dataset from the SRA archive, unpacked the .sra files, and obtained something like this:
@SRR430055.1 head0094_20100309_1_K562_2_16_40_F3 length=35
T322.2.232..222...3312.03333033333.3
+SRR490055.1 head0094_20100309_1_K562_2_16_40_F3 length=35
!+6-!&!/.5!!(/0!!!'%%+!&(1-,%%(+'%!/
@SRR430055.2 head0094_20100309_1_K562_2_16_106_F3 length=35
T321.2.320..220...3033.22300102021.3
+SRR430055.2 head0094_20100309_1_K562_2_16_106_F3 length=35
!78.!%!%%%!!,3)!!!%*&&!+5%'%)'2-,%!(
@SRR430055.3 head0094_20100309_1_K562_2_16_114_F3 length=35
Then I am trying to map this file with Bowtie, and it does not recognise it. I previously worked with Solexa, this is my first experience with SOLiD reads. Questions: 1) whether the reads above look normal, and if not, do you think there was a problem during unpacking from .sra? 2) If the reads look OK, what is a proper command line to run Bowtie with SOLiD reads? Thank you!
Bowtie or Bowtie2? Bowtie2 doesn't support colorspace AFAIK
Also, AFAIK SOLiD output is normally a combination of two files, a .csfasta file with the read info, and a QV.qual file with quality info. The reads you posted are in colorspace FASTQ format, which I am not sure if bowtie supports (I could be wrong).
Yes, but I meant the reads and quality information being in the same file. Whenever I used colorspace, I had separate files for reads and quality information.
Yes, I am using Bowtie1.
Could it be that I incorrectly unpacked the SRA file? I was using fastq-dump without additional parameters. Is the file listed above OK?
Aren't these symbols strange?
!+6-!&!/.5!!(/0!!!'%%+!&(1-,%%(+'%!/