Hi Biostars community,
Let me explain my situation: I have to determine the expression of different genes in different tissues. Each tissue has a biological replicate coming from another individual (the only difference is 3 months in term of age). These two individual are healthy so my expectation is that there is no a significant difference in gene expression between them. So it is not useful in this case to perform DEA with DESeq and so on.
I calculated TPM values from raw reads with Salmon. I know that there are a lot of posts about this but in each post there are different information but none of them have a definitive answer.
My question is: in my situation is it fine to utilize TPM discarding all summarized gene counts other than those with raw counts > 10 and consider expressend genes with row counts >10? Or it is better to move toward another normalization method like zFPKM to identify the threshold between active and background gene expression?
I'm sorry if this post sounds repetitive with others. I really appreciate any suggestions.
Thank you for your time!
If you are working with different tissues, why can't you do a differential expression analysis? Despite having 2 healthy individuals, you have 2 different tissues, which are 2 different conditions. This is what
DESeq2
is actually doing, comparing different conditions.The issue of replicates coming from 2 different individuals can be easily addressed with a batch effect normalization (you have how to do it in this post).
Actually I'm not working with 2 different tissues but with a lot. For each tissue of an individual I have the biological replicate of the same tissue in the other individual. I don't know if I have a consistent batch effect because the procedures (e.g. poly-A selection, library preparation etc.) were performed the same day. Beside this I could have, of course, a strong batch effect and I'm working to construct a PCA plot to establish it and eventually correct it.
Anyway with DESeq is possible to set a tissue as reference and perform the DEA versus the other tissues, right? So your advice is to set every time a tissue as reference and then perform DEA against the other tissues?
Exactly! In a 2 tissues example, being A the reference tissue:
You will find that the vast majority of genes won't change, some of them will have a positive log2FoldChange (B specific) and some will have negative log2FoldChange (A specific).
Probably it is a stupid question so sorry. But is it possible to compare A vs B,C,D and so one in the same time or is it necessary to compare A vs B, then A vs C and so on?
You can do all the comparisons at the same time, read in DESeq2 vignette about the design if you have any doubts. As you are only comparing 1 variable (tissue) it will (should) be easy. Good luck!