Entering edit mode
5.0 years ago
Joe Kherery
▴
140
I have a list of genes from an Affymetrix microarray analysis.
However I have duplicate genes, I wonder how do I collapsed to unique genes by calculating the mean expression of transcripts from the same gene locus?
Any information is valid.
Thank you!
Hello kevin
I tried
avereps()
function, after compute statistics for the data, however I don't thinkavereps()
did the media, it seems that he just removed the gene with the lowest value.or should I do this before generating my list of DEGs?
Yes, this 'summarisation' step is typically done prior to differential expression analysis. I show here how
avereps()
calculates the mean: A: How to combine expression values of multiple probes for one gene?Thank you so much Kevin!
Hello kevin
you know how to access
avereps
using the functioneset <- readExpressionSet("data.txt", header=T)
I always get the error:
Ah, your input object is an ExpressionSet, so, that changes the behaviour of the function. The bahaviour of many functions varies depending on the class of the input object (SingleCellExperiment, ExpressionSet, EList, data matrix, data frame, etc).
What is stored in
eset$geneID
?Actually I have a data.txt file which is normalized microarray data and I also have Annotation data. I can easily analyze normalized data.txt, but in the end when i merge with my Annotation list, I end up with duplicate genes.
So I decided to merge data.txt + annotations and use the
avereps
function to avoid duplicate genes.But that's not doing well.
the list of genes
Could I not do it that way?
1.Find my list of differentially expressed probes after merge with the annotation and then do the
avereps
? and remove the NA?Oh, but you can just read data.txt into a standard data-frame or data-matrix. Do you need it to be an ExpressionSet? limma can work with any data-matrix / -frame
Here is a quick example:
Hello Kevin,
I try it, but dont work.
I don't think I need to read it as
ExpressionSet
, my data.txt contains probe ID names and samples ... but I don't think i can analyze it any other way ...or is it simply read use
eset_fil<-avereps(eset,ID=eset$geneID)
and do as I usually do inlimma
?or there is a point I have to make after I do the
avereps
? para usar olimma
?I will try as you exemplified. Thanks
Fala português? In the example above, you should have:
..where eset_fil is the output of
avereps()
Sí, pero no mucho.
I did as you said, but I still get this error.
I tried transform
eset_fil
into data frame but same error. I triend as.matrix.... not resultsCan you show all of your steps from when you read in the file's contents? Also, paste here a few rows of your data after you have read it in. Remember that you do not have to use
readExpressionSet()
- just useread.csv()
orread.table()