I am running an RNA-sequencing experiment where I am analyzing the differential gene expression of oysters collected from different locations. I plan on using the Cyverse DNA Subway Greenline platform which utilizes Kallisto and Sleuth. Since I will be conducting multiple comparisons (ie oysters from Site 1 vs oysters from Site 2 vs oysters Site 3 etc.), I understand that this could run into significant statistical issues involving inferential and individual variation of each sample. Will the Kallisto and Sleuth algorithms correct for this? I imagine I will need to run all of my samples simultaneously through Kallisto so that normalization is done across all samples. Will this be sufficient to mitigate the noise from individual sample variation and make biological variation more significant? Or would I need to employ normalization methods such as TMM via edgeR? I am pretty new to this and learning along the way so any feedback is much appreciated!
Thanks in advance.