Question

How To Combine Solid Ngs Reads From Technical Replicates

1

Entering edit mode

14.6 years ago

Ian 6.1k

I have four sets of SOLiD NGS reads that are technical replicates of the same biological sample. My instinct is to map the reads separately for each replicate and then combine them for downstream analyses.

However, does anyone know whether it makes any difference combining the reads pre- or post-mapping?

The only thing i can think might be a problem about the former is that read ids might be replicated...

Normally we wouldn't have technical replicates, but this was done during a trial run of the bar-coding work-flow on a SOLiD 4 machine.

solid replicates • 4.5k views

ADD COMMENT • link updated 11.2 years ago by Biostar 20 • written 14.6 years ago by Ian 6.1k

score 4 · Answer 1 · 2010-10-08

4

Entering edit mode

14.6 years ago

Istvan Albert 102k

If you merge your replicates early on you lose the ability to compare them later in the analysis. There does not seem to be any disadvantage to merging them as late as feasible.

Alternatively you can merge early but once you find something of interest you should verify that the data for each of your replicates does indeed support the new results.

ADD COMMENT • link 14.6 years ago by Istvan Albert 102k

3

Entering edit mode

If you're using BAM format to store your data, there's really no reason to lose provenance, even at the resolution of individual reads. Put each tech rep in its own Read Group - the Group is annotated in the header and each read is flagged with a Group identifier. Then you can merge them in files and analysis operations, but still unmerge them later.

ADD REPLY • link 14.6 years ago by biobot 0.0.77.a.1099 6.2k

0

Entering edit mode

As these are technical replicates i am not so much interested in common themes between biological samples, but more like the right way to combine them (as they are from the sample biological sample).

It may be for want i want to do it makes no difference when they are combined.

Sorry if i am not making sense!

ADD REPLY • link 14.6 years ago by Ian 6.1k

0

Entering edit mode

I see, though some of the issues mentioned still apply, you may be able to asses variability with respect to the techniques that the replication covers. The moment you merge you lose the fact that there were replication in your data. That may or may not matter in a particular study.

ADD REPLY • link 14.6 years ago by Istvan Albert 102k

0

Entering edit mode

Keith have you got any pointers for using 'samtools merge'? '-n' does not seem to sort and I am not sure how '-r' works. How are the merged BAM files resolved back into their separate files?

ADD REPLY • link 14.6 years ago by Ian 6.1k