How to do the data imputation of my protein expression data(values)
0
0
Entering edit mode
5.1 years ago

I am currently working on a protein expression data of breast cancer where rows are my proteins (Refseq) and columns are my samples. I have 77 cancer affected samples, 3 replicates and 3 normal samples and have a lot of missing values. My data is normalized and contains log2 iTRAQ ratios of each sample. I want to do data imputation of my data and working in R and confused about what data package should I use for the data imputation or what should be my approach towards the data as I am planning to perform gene set analysis using the GSA package in R. And can I do a PCA plot to find out how the cancer subtypes are distributed across the sample?

Thanks in advance.

Regards

rna-seq R next-gen alignment gene • 1.4k views
ADD COMMENT
2
Entering edit mode

There are many ways to impute, see:

ADD REPLY
0
Entering edit mode

Thank you will look into it.

ADD REPLY
1
Entering edit mode

Do you need imputation to start with? For example, there are ways of doing PCA with missing values (e.g. this paper).

ADD REPLY

Login before adding your answer.

Traffic: 2044 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6