Hello everyone,
I have a question today which doesn't necessarily has anything to do directly with R, but I was hoping to get some answers here.
I was approached by one of our biologists which a somehow unusual problem.
They are doing a microarray analysis of miRNA with Drosophila and have several body parts as well as whole fly arrays. Now they don't just want to see what are the differences between the different body parts, but also in general would like to know what happens in each of the body parts on themselves. What she asked me was, whether is it possible just to see in each array group (without comparing it to the other groups) what miRNA are expressed.
The way I was thinking about it was, to order the miRNA according to their expression values and just to try t find a cut-off, where I can divide the "real" expression from the background noise.
The way of finding this allegeable threshold will be like doing a BLAST search with the rest of the miRNA (which are not from Drosophila on this array) and look for the highest non-Drosophila hit in the BLAST which is also in the List of this expression values. Than look for the first one with at least 4 mismatches ( this is an arbitrarily choice and can be refined if needed). All the hits bellow this one will be considered as noise, The rest can be look at as "really" expressed.
So what i really want to know is, whether this kind of approach is in any sense reasonable at all and if there is any other way of doing the same analysis
thanks for any suggestions and help
Assa