The following graph shows an example cell trajectory analysis with monocle.
I have a rough understanding of how monocle constructs the pseudotemporal ordering of cells, using independent component analysis and minimum spanning trees (MST). However, there is one bit of the larger picture, which I'm still missing. How can monocle tell apart the most progenial cell (the source node; the beginning of the pseudotime) from the most differentiated cell (the sink node; the end of pseudotime)?
How does monocle know the direction of progress?
I can think of two possible type of answers to this question:
- Pseudotime can in fact be reversed with respect to the real, biological time (i.e. beginning of pseudotime actually corresponds to the most differentiated cell, and the end of pseudotime corresponds to the most progenial cell).
- There is something I don't understand about the biological significance of how monocle computes the minimum spanning tree, the associated diameter path, and the associated PQ tree -- and the process can, in fact, detect biologically correct root node, or it can detect it most of the time.
It is perhaps worth noting that another technique to establish cell trajectory, i.e. RNA velocity, has a very well understood method of establishing precedence, based on differential equations and mRNA biology.
Thank you Kristoffer -- this makes sense. I think it's a bit misleading of the authors to be speaking of the "beginning of pseudotime", which could just as well be the end of pseudotime. Yes, RNA velocity can establish precedence, as I've written at the bottom of my question. The way this is done is well understood both mathematically and biologically, and involves a system of differential equations linking unspliced, spliced mRNA and its breakdown.
Well you have to label it something :-)