摘要 |
A speech recognition apparatus generates a feature vector series corresponding to a speech signal, and recognizes a phoneme series corresponding to the feature vector series using sounds corresponding to phonemes and a phoneme language model. In addition, the speech recognition apparatus recognizes vocabulary that corresponds to the recognized phoneme series. At this time, the phoneme language model represents connection relationships between the phonemes, and is modeled according to time-variant characteristics of the phonemes.
|