Hi all,
When we want to develop a model by the gene classifiers (using machine learning algorithms), is it better to narrow the gene list and have a more clear pattern to be applied to the validation data set?
Why is this? if some of the excluded genes are important for the phenotype of interest, which one is prioritized? A good pattern of the developed model after removing the gene or keeping the gene in the model?
Thanks
Rob
Thank you Shred