5 questions to test your understanding
In an HMM, the Viterbi algorithm is used to find which of the following?
A speech recognizer observes a sequence of acoustic feature vectors. It uses an HMM where hidden states represent phonemes. What are the 'emissions' in this model?
The forward algorithm computes the probability of an observation sequence by dynamic programming, avoiding the need to enumerate all possible hidden state sequences.
Baum-Welch is very likely to find the globally optimal HMM parameters if run to convergence.
Explain why finding 'the most likely state sequence' (Viterbi) is a different problem from finding 'the most likely state at each time step,' and describe a case where the two answers could differ.