Alignment of single/paired end reads with rsubread align
0
0
Entering edit mode
4.0 years ago
qstefano ▴ 20

Hi biostars,

align function by Rsubread package allows to set parameters according to reads layout (single or paired), since i download fastq files from SRA, if the layout is paired, i obtain a single fastq files containing both pairs.

The question is: if i don't specify if fastq file is paired or not, align would detect "automatically" this and consequently align reads correctly? or align the reads as if it were a single layout?

any reply will be appreciated, thanks.

RNA-Seq alignment R sequencing genome • 1.1k views
ADD COMMENT
2
Entering edit mode

You can easily split the files using reformat.sh from BBMap suite.

reformat.sh in=input.fastq out1=R1.fq out2=R2.fq

You could have also used --split-files option with fastq-dump to get pair of files.

ADD REPLY
0
Entering edit mode

@genomax just as an aside, would reformat.sh produce an empty R2.fq if the input is unpaired reads?

ADD REPLY
1
Entering edit mode

Yes. If inputs can't be recognized by /1 or /2 (old Illumina data) or 1:N: or 2:N: (current format) at the end of read headers.

ADD REPLY
0
Entering edit mode

That's clear, thanks!

ADD REPLY
0
Entering edit mode

That is for interleaved files. Be sure that fastq-dump did not simply concatenate the mates into a single long read which it sometimes does. Confirm that you have one entry per mate otherwise alignment is going to be a mess.

ADD REPLY

Login before adding your answer.

Traffic: 2332 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6