Hi there,
I would like to determine whether my gene age distribution plot contains potential whole-genome duplications. I would like to do with a K-S goodness-of-fit test by comparing the paralog ages in my plot to a simulated null distribution (without ancient WGDs).
My question is as follows: Is there a way for an R novice like myself to model this null distribution in order to compare it with my gene age distribution plot? I found the following paper describing how to model the age distribution of paralogs under a constant birth–death model but I'm finding it very difficult to understand and implement: Widespread genome duplications throughout the history of flowering plants
Some advice or useful resources would be extremely appreciated.
Thank you for your time