Hi all,
I want to assemble paired-end sequences and mate-pair sequences (HiSeq2000) together using SOAPdenovo.
On SOAPdenovo home page, mate-pair usage is written as follows: "Mate-pair relationship could be indicated in two ways: two sequence files with reads in the same order belonging to a pair, or two adjacent reads in a single file (FASTA only) belonging to a pair." (http://soap.genomics.org.cn/)
How can I convert raw mate-pair FASTQ file into proper format for SOAPdenovo assembly? Is there any converting script?
Thanks.
Thank you for your help. "reverse_seq=1" is a solution for using mate pair sequences! Below is SOAPdenovo home page's comments. "There are two types of paired-end libraries: a) forward-reverse, generated from fragmented DNA ends with typical insert size less than 800 bp; b) reverse-forward, generated from circularizing libraries with typical insert size greater than 2 Kb. User should set parameter for tag "reverse_seq" to indicate this: 0, forward-reverse; 1, reverse-forward."