I have raw count data and TPM data for 12 samples that were measured using RNA-seq and quantified using STAR-RSEM pipeline. This data was measured in units of 6 samples, so the data for these 12 samples contains batch effects. I would like to apply cluster analysis to the log2 TPM data for the 12 samples. In this case, I should obtain TPM data that has been corrected for batch effects, then perform log2 conversion and apply cluster analysis. But, what kind of processing should I perform on the raw count or TPM data output from RSEM? I would be grateful for your guidance.