I have a big dataset (7000) which has categorical variables(gene names) and numerical features, I want to find important genes among others based on their features Based on age(numeric) and time of exposure to harmful substances(numeric), for example the higher the age and time of the exposure, the higher the rank of the genes. In addition, I have also the data of up&down regulation of each gene.
Could anyone suggest me some methods (rather than RRA) than can be suitable for this purpose?
Important how? That separate samples? It's not clear how you're defining "important" here or what sort of data you actually have.
Based on age(numeric) and time of exposure to harmful substances(numeric), for example the higher the age and time of the exposure, the higher the rank of the genes. In addition, I have also the data of up&down regulation of each gene.
I do not believe this question can be meaningfully answered given the description provided. You mention expression in the final sentence, otherwise we wouldn't even know that.
Be clear. You have (RNA micro arrays; bulk RNA seq; scRNA seq) on 7000 (people, mice) for a (complex, mendelian, somatic) disease. I have 4 (numerical) covariates and ....
otherwise I can assure you that you are unlikely to get the best answer you can.