Hello everyone!
I am a student that recently started working with transcriptomics data.
I am trying to conduct my first single cell data analysis of an organoid model using mainly Seurat. I tried to conduct a differential expression analysis between different clusters using the FindMarkers()
function but I have some questions.
I would like to understand what test would be best to use if MAST or Wilcoxon Rank Sum. I looked at recently published scRNA-seq analysis and they are both commonly used.
I would also like to ask you how you approach the selection of the top DEGs, namely what are the best p_value_adjust and avg_logFC thresholds to use.
I am really new to the field and I would really appreciate any kind of insight.
My personal pessimistic opinion: Just do whatever you want. Differential gene expression in scRNA-seq is broken anyway.
Edited (because people felt the need to insult+flame the original person I cited):
EDIT: I was skeptical because I am a little biased against Lior. The original claim of discrepancy in results does look concerning and warrants further research.
He (or rather, the person whom he cited -- Nikolai Slavov) has identified a major problem with differential gene expression mathematics -- there's nothing to believe or not believe or to feel; the programs+libraries give starkly discordant results. There's no good solution to these problems -- just as there is no perfect solution to single-cell RNAseq normalization. Single-cell RNAseq has many major problems without solutions. Please don't disregard an important result (substantiated both before+after by other scientists) just because it's from someone whom you have a negative opinion of. There's no reason to cast unnecessary shade.
I try contributing to biostars (I'm on here every day trying to assist members of the community), helping others, citing important results, and then people see it fit to attack+flame my Ph.D. advisor and get upvoted.
If you'd like, I'd be happy to edit my post to cite Nikolai Slavov or Mark Sanborn instead who made the same observations (I cited Lior because he provided mathematical details).
I was trying to help the original poster, who is a student like myself, by pointing out discrepancies in the field and how it probably doesn't matter what method you use for extracting the biological signal most people are interested in -- single-cell analysis methods (and even "bulk" methods) are still developing and we really don't know an "optimal" solution. My colleagues are wonderful people who have put a lot of work into investigating such observations.
Anyway, please try keeping biostars a positive community :)
Thank you for your contributions dsull much appreciated.
I would also say that when it comes to science we should be steering clear of personal biases and treat information based on its actual content.
I agree that no important result should be disregarded owing to bias but I'd by lying if I said what I have heard and seen about Lior has not affected how his words impact me. I don't mean to and I am nowhere near qualified to attack him, but his negativity kinda leaves a bad taste after viewing his interactions. He is an excellent scientist, no doubt. I just wish he did not come off as abrasive as he does.
I appreciate this statement and it largely mirrors my own. Lior sometimes makes extremely salient and important points. Other times, he's thoughtlessly overly aggressive and personal - as many people can be. That does not detract from his (or his group's) contributions or observations...but it does introduce a bias.
Thank you so much for all the replies! I am new to the field and all of the insights are really helpful and much appreciated