Entering edit mode
2.5 years ago
Athena
•
0
I'm wondering if using data from there (link below) i could find/calculate TPM https://support.10xgenomics.com/single-cell-gene-expression/datasets/2.0.1/pbmc4k or is this data not sufficient enough to do that?
Why do you want TPM? TPM divides counts by transcript length but with UMI-tagged data, it isn't necessarily true that longer length -> more counts. I'd recommend not dividing by transcript length for 10X data.
Im trying to run a correlation test using my bulk data (either using RPKM/FPKM/TSM) and do some further downstream analysis.
What would be a better method then, if you do not recommend dividing transcript length?
Just don't divide by transcript lengths. Just take a gene's UMI count and divide it by the total number of UMIs in a cell (this is essentially what TPM is except we're not dividing by transcript length).
Just use the raw counts. All of the above methods introduce a single linear scaling factor so correlation does not change regardless of the method -- unless you introduce something like a per-gene factor such as length -- which as pointed out makes no sense for 10X data.