发明名称 Testing a grammar used in speech recognition for reliability in a plurality of operating environments having different background noise
摘要 Methods, systems, and products for testing a grammar used in speech recognition for reliability in a plurality of operating environments having different background noise that include: receiving recorded background noise for each of the plurality of operating environments; generating a test speech utterance for recognition by a speech recognition engine using a grammar; mixing the test speech utterance with each recorded background noise, resulting in a plurality of mixed test speech utterances, each mixed test speech utterance having different background noise; performing, for each of the mixed test speech utterances, speech recognition using the grammar and the mixed test speech utterance, resulting in speech recognition results for each of the mixed test speech utterances; and evaluating, for each recorded background noise, speech recognition reliability of the grammar in dependence upon the speech recognition results for the mixed test speech utterance having that recorded background noise.
申请公布号 US9396721(B2) 申请公布日期 2016.07.19
申请号 US201113289233 申请日期 2011.11.04
申请人 Nuance Communications, Inc. 发明人 Agapi Ciprian;Bodin William K.;Cross, Jr. Charles W.;Mirt Michael H.
分类号 G10L15/20;G10L15/01 主分类号 G10L15/20
代理机构 Wolf, Greenfield & Sacks, P.C. 代理人 Wolf, Greenfield & Sacks, P.C.
主权项 1. A system comprising at least one processor configured to: analyze digital data representing sounds captured by at least one microphone from an operating environment to compute background noise information associated with the operating environment, wherein: the at least one processor is configured to match the sounds captured from the operating environment to a background noise from a plurality of background noises, andthe background noise information comprises an identification of the background noise matching the sounds captured from the operating environment; select, based at least in part on the background noise information associated with the operating environment, a voice dialog from a plurality of voice dialogs, wherein: the at least one processor is configured to select, based at least in part on the background noise matching the sounds captured from the operating environment, one or more grammars for use in carrying out the voice dialog with a user; and perform automatic speech recognition, using the one or more grammars, on user speech captured from the operating environment.
地址 Burlington MA US