Entering edit mode
4.7 years ago
wiscoyogi
▴
40
I have two datasets that I want to compare.
My problem is that in one of the datasets, there were a lot of PCR duplicates, so the number of unique molecules are particularly low and there are fewer overall counts. The count values that I’m then getting are making conclusions from the biology hard with my other dataset that did not have PCR duplicates.
Do you have any suggestions for what transformations are available so that I can make a fair comparison between the two datasets?
How many replicates are in these datasets?
I had 16 biological replicates and there were no technical replicates.
As said you typically do not care about duplicates in RNA-seq. I would run it through the DGE pipeline and see if results are reasonable. Also check by PCA if things look good.
How did you conclude that? One can't be absolutely certain about those unless you had UMI's.
I know there were a lot of PCR duplicates because I did have UMIs present.