I have a dataset with 50k genes as rows in a dataframe and there are 500 samples as columns with TPM expression values. I want to classify these tumor samples samples into two groups i.e. Gene_High and Gene_low based on TPM expression values.
Before that I want to filter out low expressed genes, which are of no use. There are some genes with showing TPM value for only 50 samples and the rest 450 samples are 0.
So, if I have a dataframe df
with 50k genes as rows and 500 samples as columns how to fillter out low expressed genes? How to give the command to filter out low expressed genes in R?