Dear all, I have several RNAseq paired end datasets that I want to pool for input into a Trinity de novo assembly. The problem is that half of my datasets are of library type RF (first read (/1) of fragment pair is sequenced as anti-sense (reverse(R)), and second read (/2) is in the sense strand (forward(F))) and half of my data are FR: first read (/1) of fragment pair is sequenced as sense (forward), and second read (/2) is in the antisense strand (reverse).
Is there a way to convert the library type? Simply reverse complementing the reads will not work since then they won't be "pointing" at each other. Any ideas?
Good suggestion. He probably knows which libraries are FR and which are RF, so he can follow your suggestion. His assembly should be a bit better using strand information.
Oh, I guess I misinterpreted the question as "half my reads are RF". Yes, if all libraries are purely FR or purely RF, it's easy.
Thanks, Brian. I worked out which libraries were incorrect by aligning to another transcriptome and looking at the flags in the SAM output. With that I could calculate a ratio of "first read mapped to forward strand" to "first read mapped to reverse strand". Your suggestion about swapping the R1 and R2 worked and reversed this ratio so I'm good to assemble!
Glad to hear it! If you think this answer resolved the question, please "accept" it so that other people know it is resolved.