Hi everyone,
I have a question about bowtie2 in mapping.
I have my paired-end sequencing data, but due to some reason, I got a relatively low read number for the R1 data. But the R2 data have much more reads than R1 data. In that case, if I mapping genome in pair-end way, I think the total reads which can be mapped onto genome will be limited due to the low reads number in R1.
So, can I just use R2 data to do the single end mapping onto the genome? If yes, because R2 contains the sequencing reads for the reverse complementary strand, what parameter should I use when I run bowtie2 to make R2 reads mapping the reverse complementary strand of the genome?
Thanks for kind help!
Zhenhua
Thanks for your answers, which are very helpful. You solved most of my questions. I will check the data and to see if there are more problems come out.
Hi, I got further questions. As I mentioned before. My R1 file contains more reads than R2. So when I run bowtie2 in pair-end way, at the beginning, the program run properly and I can see the output SAM file size increasing, which means the results are written in the SAM file. However nothing was written in the file I designated for putting the unaligned reads. The biggest problem is the program will die when the output SAM file reached around 1GB with the information saying
Error, fewer reads in file specified with -1 than in file specified with -2, terminate called after throwing an instance of 'int' [bam_sort_core] merging from 2 files... bowtie2-align died with signal 6.
I think this error information tell me the problem I know about the data which is the R1 file have more reads than R2. So do you have any idea about how to let bowtie2 run pair-end mapping regardless of my data problem?Many thanks.
You can't mix paired-end and single-end data with bowtie2.
As Devon replied you can't use the mixed data, you will have to create two set of files. First set of files will represent paired-end data and second set of files will represent orphan reads or single end data. I already posted a link in my answer that will help you to create the ordered paired-end fastq files. If your paired-end fastq files are already ordered but the only problem is with the less number of reads in one file, you can simply remove the orphan reads from the other fastq files to create paired-end fastq files. The removed reads can then be aligned as a single-end reads. It should be fairly simple. Basic UNIX commands should be sufficient.