摘要 |
A voice recognition apparatus determines whether an input sound is a voice segment or a non-voice segment in time series, generates a word model for the voice segment, allocates a predetermined non-voice model for the non-voice segment, connects the word model and the non-voice model in sequence according to the time series of the segments of the input sound corresponding to the respective models and generates a vocalization model, and coordinates the vocalization model with a vocalization ID in one-to-one correspondence, and stores the same.
|