Hi,
I have 40 samples which I have sent for RNA-seq to the depth of 30-60M reads per sample. The sequencing company have sequenced my samples in 3 different batches to achieve the required depth.
My questions are:
1 is there a 'best practice' for how to combine the data from 3 batches into one before DGE analysis? I am aware that there's a method to merge bam files using samtools but I also know of those who convert individual bam files into counts and combine the counts after. Both sounds equally reasonable to me. Is there a different in each method? if so, which is better?
- At present, I have decided to try out the second option - convert bam files to counts and combine them. For DGE - should I add all the counts from 3 batches together or use the average? (The sum seems to be the more logical method to appreciate the DGE at the required depth.)
Thank you. Appreciate all advice as I am new to this.
Are these technical replicates? Meaning you took one library and put third in each run?
These are technical replicated all from the same library prep. They were sequenced 3 times to achieve the required depth of 30-60M reads/sample