摘要 |
PURPOSE:To improve the voice recognition rate of a speaker at the time of voice interaction and to reduce the size by recognizing the voice of spoken contents that an operator speaks after the vocalization of a vocalizing means by referring to a selected statistical language model. CONSTITUTION:This system is provided with model memories 12-1-12-6 storing previously plural mutually different statistical language models according to scenes in the interaction, and one of them is selected at any time according to a scene of the interaction. In the concrete, a state of 'interaction between a user and the interaction system' is assumed, and a syllable trigram is selected by predicting the vocalization of the user from the system side. Namely, a voice recognizing process is performed by referring to a language model in one of the language memories 12-1-12-6 switched selectively according to the scene of the last spoken contents from the system side. |