Question

How to define the gene value if a gene is represented by multiple probe sets

1

Entering edit mode

10.2 years ago

zero_hsy ▴ 110

Hello:

I was doing .cel.And when I mapped the probe id to gene symbol, I found sometimes a gene is represented by multiple probe sets. How to define this gene value?

gene-value cel probe-id • 2.5k views

ADD COMMENT • link updated 2.5 years ago by Ram 45k • written 10.2 years ago by zero_hsy ▴ 110

1

Entering edit mode

+1. In my opinion this question has received surprisingly little attention, especially considering that different probes on the same gene might show very different patterns of expression. I vaguely remember a paper suggesting that the best way to represent the whole expression of a gene is to use the the probe with the highest intensity (can't find the ref just now...).

ADD REPLY • link 10.2 years ago by dariober 15k

0

Entering edit mode

9 out of 10 times the probe with the highest signal is the 3'-end probe forum reasons I explained in my answer below.

ADD REPLY • link 10.2 years ago by Irsan ★ 7.8k

score 2 · Answer 1 · 2015-05-05

I suppose you are referring to gene expression arrays that can have multiple probes per gene. If you want to have a single value you should take the probe closest to the 3'-end of the gene (so the end of the gene). This is because RNA molecules get degraded from the start of the molecule and therefore the signal/probe at the end of the molecule is most reliable. The reason why multiple probes were designed per gene was to quantify the extent of RNA degradation and to allow for differential isoform expression analysis.