I am performing a meta-analysis and I am looking for processed RNA-seq data. I would prefer counts data, but if that is not available, then TPM normalized data would be acceptable.
I found a dataset that only has logCPM normalized RNA-seq data published.
My question is, is there a way to convert this data to regular TPM data? Or even better, to counts data?
Thank you.
I have the transcript length of each gene. I can extract them directly from biomart. Given I have that, and also lets say the number they devided by was 1000, what is the process to convert CPM to TPM?
Genes have multiple transcripts, and therefore multiple transcript lengths. Do you have the abundances of individual transcripts?
hmm I'm not sure what you mean, but this is my code:
This code gives back a single value (a transcript length) for each genes found in the counts matrix.
Yes, it will give you one length, the length of the canonical transcript. But is it right to use only that when you have isoforms present of other lengths?
I just don't see the point of doing a math manipulation when you know the number isn't real. It just distorts the true experimental values.