Question

When Can A Markov Model Be Described As "Hidden"?

6

Entering edit mode

14.9 years ago

Melissa ▴ 60

We have used a Markov model with several window sizes to generate random genomes. I am wondering if it would be accurate to describe this model as "hidden" or solely as a Markov model.

random genome hmm • 3.8k views

ADD COMMENT • link updated 14.3 years ago by Jarretinha 3.5k • written 14.9 years ago by Melissa ▴ 60

0

Entering edit mode

Interesting to learn about this subject. I was unaware of the distinction. :)

ADD REPLY • link 14.9 years ago by Eric Normandeau 11k

0

Entering edit mode

Usually in Population Genetics we use a "regular" Markov Model, not hidden, everything is open. In other parts HMM are the mostly used.

ADD REPLY • link 14.9 years ago by Paulo Nuin ★ 3.7k

Ram · Answer 1 · 2010-05-26

From Wikipedia

In a regular Markov model, the state is directly visible to the observer, and therefore the state transition probabilities are the only parameters. In a hidden Markov model, the state is not directly visible, but output, dependent on the state, is visible. Each state has a probability distribution over the possible output tokens. Therefore the sequence of tokens generated by an HMM gives some information about the sequence of states. Note that the adjective 'hidden' refers to the state sequence through which the model passes, not to the parameters of the model; even if the model parameters are known exactly, the model is still 'hidden'.

It will depend on how you are generating the sequences, we would need more information in order to identify the type used. You can complement/edit your question in order to do so.

score 4 · Answer 2 · 2010-05-26

4

Entering edit mode

14.9 years ago

Jarretinha 3.5k

Generating sequences according to a given alphabet with known probability per symbol and with specified correlation (1st degree, 2nd order, window size, etc.) is always a simple Markov Model. Most random genome generators use such models. In this case you know the state of the chain every time step and transition probabilties too. You just don't know the output, i. e., the random genome.

In the case of the HMM, you know the random genome and want to generate the model or determine in what state of it your random genome belongs.

That's why we use Markov models for bootstraping purposes and HMM for family/relationship inference/clustering.

ADD COMMENT • link 14.9 years ago by Jarretinha 3.5k

0

Entering edit mode

If you are generating sequences with indels, sometimes you don't know the state of the chain most of the time, as indels depend on evolutionary processes that might occur or not.

ADD REPLY • link 14.9 years ago by Paulo Nuin ★ 3.7k

0

Entering edit mode

Indels are just another state in your Markov chain. You can even treat them separately and with variable size. To be hidden means state not known. If you know that somewhere there's a indel, then it's not hidden at all.

ADD REPLY • link 14.9 years ago by Jarretinha 3.5k

score 2 · Answer 3 · 2010-05-26

While this isn't a 'cut-and-dry' answer the rule of thumb I've used is that:

In a normal Markov model you are using a single set of transition probabilities to generate a random sequence which follows a specific pattern. In a hidden Markov model you are using a separate "hidden state" which determines what the transition probabilities.

The classic example is scanning a set of sequences for gene coding and non-gene coding regions. Since they have different transition probabilities one can distinguish between them by looking at a window of sequences.

If your generative process has two (or more) sets of NT transition probabilities (ie. one for gene coding and one for intergenic regions) then its a Hidden Markov Model. If you simply define a single set of transition probabilities then its a normal Markov Model.

Hope that clears up the difference,

Will