Use jellyfish with paired end sequencing data
1
0
Entering edit mode
7.4 years ago
db • 0

I am wondering how exactly we should run jellyfish to count k-mers for paired end sequencing data. Is it enough to concatenate the forward and reverse read (using cat) or do we need to merge the reads (using something like PEAR)?

jellyfish • 5.0k views
ADD COMMENT
3
Entering edit mode
7.4 years ago

Concatenating the files works fine (I think you can just give it multiple files, but the syntax in the help message is a little confusing). Merging the reads is also a possibility, but you don't really need to do so in this case, and note that it would change the counts and yield even more files (merged, r1 unmerged, r2 unmerged).

ADD COMMENT
0
Entering edit mode

Thank you Brian. Are you saying it is ok to concatenate based on your experience (not that I don't trust you)? It's just that I could not find any documentation on how to deal with paired end data.

ADD REPLY
1
Entering edit mode

For pure kmer counting, read pairing does not matter, and concatenated fastq files are still valid fastq files. I have never actually concatenated two files and then run Jellyfish on the result, but I have done that with various other programs.

ADD REPLY
0
Entering edit mode

Thanks you very much!

ADD REPLY

Login before adding your answer.

Traffic: 1088 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6