The standard HMM

Following the notation of Rabiner [65], there are observation times. At each time $1\leq t \leq T$ , there is a discrete state variable which takes one of values $q_t\in\{S_1,S_2,\cdots, S_N\}$ . According to the Markovian assumption, the probability distribution of $q_{t+1}$ depends only on the value of . This is described compactly as a state transition probability matrix whose elements $a_{ij}$ represent the probability that $q_{t+1}$ equals given that $q_{t}$ equals . The initial state probabilities are denoted $\pi_i$ , the probability that equals .

It is a hidden Markov model because the states are hidden from view; we cannot observe them. But, we can observe the random data which is generated according to a PDF dependent on the state at time . We denote the PDF of under state as $b_{j}(O_t)$ .

The complete set of model parameters that define the HMM are

$\displaystyle \Lambda = \{\pi_j, a_{ij}, b_j \}$

The Baum-Welch algorithm calculates new estimates $\Lambda$ given an observation sequence ${\bf O}=O_1 O_2\cdots O_T$ and a previous estimate of $\Lambda$ . The algorithm is composed of two parts: the forward/backward procedure, and the reestimation of parameters.

Subsections