摘要 |
A true/false judgment on a result of speech recognition is made with high accuracy using a less volume of processing. When the recognition result is judged as being true, speaker adaptation is applied to the acoustic models, and when the recognition result is judged as being false, speaker adaptation is not applied to the acoustic models. It is thus possible to improve the accuracy of speaker adaptation. <??>Robust speaker adaptation which remains unsusceptible to influences of background noises is achieved. Initial acoustic models Mc are stored in advance in a speaker adapted model storing section, and a noise adapting section generates noise adapted models Mc' by applying noise adaptation to the initial acoustic models Mc pre-stored in the speaker adapted model storing section. A speaker adaptation parameter calculating section generates speaker adaptation parameters P based on the noise adapted models Mc' and a feature vector sequence V (n) of utterances from the speaker, and a acoustic model updating section (15) generates speaker adapted models Mc" by applying speaker adaptation processing to the initial acoustic models Mc using the speaker adaptation parameters P. <IMAGE>
|