Question

How To Over Come With Unequal File Size Of Fastq [Paired End Reads 100+100] During Bowtie2 Mapping

1

Entering edit mode

11.8 years ago

Raghav ▴ 100

hello Every one,

I have got another serious problem, when I was running bowtie2 align for paired end reads I have got an error which are mentioned bellow:

[cdac@nbri bowtie2-2.1.0]$ ./bowtie2-align -x tair10ref -1 /storage/home/cdac/raghav/sratoolkit.2.3.2-4-centos_linux64/bin/LIBSG306_R1_filtered.fastq -2 /storage/home/cdac/raghav/sratoolkit.2.3.2-4-centos_linux64/bin/LIBSG306_R2_filtered.fastq -S input.sam

Error, fewer reads in file specified with -1 than in file specified with -2 terminate called after throwing an instance of 'int' Aborted.

my read1 fastq file is lesser in size as compair to read2, As per as I am getting here it is just because of unequal reads in in both fastq files.

But, what is best possible way to over come this problem???? is there any technique or syntax or tools which could help me out.

fastq bowtie2 mapping • 15k views

ADD COMMENT • link updated 11.0 years ago by Biostar 20 • written 11.8 years ago by Raghav ▴ 100

1

Entering edit mode

Are these trimmed reads, wherein you trimmed the two fastq files separately? If so, just retrim the original fastq files with a trimmer that can deal properly with paired-end data. If these are fastq files from the sequencer, you might contact whomever sequenced the samples and see if you can redownload the data; presumably one of the files is corrupt or otherwise truncated. Those are the easiest solutions.

ADD REPLY • link 11.8 years ago by Devon Ryan 105k

0

Entering edit mode

Dear dpryan, actually i have got illumina sequenced data as fstq1 and fastq2 file and data provider has mentioned that they have filtered these files at 20 phred quality. I am new one in field of NGS data analysis, so I do not have much idea of technicality, like what did went wrong with data,

No, this not a trimmed data, is there any difference if if we trim fastq1, fastq2 file individually and trim both file together [how do we trim it together???]

ADD REPLY • link 11.8 years ago by Raghav ▴ 100

1

Entering edit mode

It sounds like the sequencing provider screwed things up. With paired-end data, one needs to trim or otherwise process both files at the same time. The reads in them are in the same order, so if you filter out a read from fastq1 but not fastq2, then all subsequent reads in the files will be out of sync. What they should have done is used a program that would filter/trim reads and write unpaired reads to a separate file. This is the way most trimmers work.

In this case, you're best off following the links provided by ashutoshmits. If you use this sequencing provider again, tell them not to do this next time, since they're just making more work for you.

ADD REPLY • link 11.8 years ago by Devon Ryan 105k

0

Entering edit mode

yes..... I trapped here as you said earlier and I tried it at my level best to over cum with such problem like to remove unique reads from either files [fq1 and fq2] and synchronize both reads properly which keep only paired reads data but at the end of day unable got any fruitful result. please help or suggest me if you have any idea to over cum such a problem, thank you

ADD REPLY • link 11.8 years ago by Raghav ▴ 100

0

Entering edit mode

What have you tried and what have the results been? If push comes to shove, you (or I, as a last resort) can write a simple bash/python/perl script to do this sort of thing.

ADD REPLY • link 11.8 years ago by Devon Ryan 105k

0

Entering edit mode

before going to write any script, I tried to find out some existing program and fortunately I have got a python script fastqCombinePairedEnd.py written by Eric Normandeau, which resole it perfectly and it is working properly I tried no. of combinations to find bug, at last it succeeded to resolve my problem. Thnq for guiding me.. :)

ADD REPLY • link 11.8 years ago by Raghav ▴ 100

1

Entering edit mode

Some related posts. There are many that deal with the same topic. So search thoroughly and you should be able to get something.

How to sort two mate pair (fastq) files so that the order of the identifiers is the same?

filtering paired end reads

ADD REPLY • link 11.8 years ago by Ashutosh Pandey 12k

0

Entering edit mode

Thank you for providing such a fruit full link

ADD REPLY • link 11.8 years ago by Raghav ▴ 100

score 2 · Answer 1 · 2013-06-05

2

Entering edit mode

11.8 years ago

Ashutosh Pandey 12k

This post does exactly what you want. See the answer from the user lh3 (Dr. Heng Li)

Combining the paired reads from Illumina run

ADD COMMENT • link 11.8 years ago by Ashutosh Pandey 12k

score 1 · Answer 2 · 2013-06-04

1

Entering edit mode

11.8 years ago

Ashutosh Pandey 12k

I use the following tool to trim my paired end reads. It takes care of the orphan reads.

https://code.google.com/p/ea-utils/wiki/FastqMcf

ADD COMMENT • link 11.8 years ago by Ashutosh Pandey 12k