发明名称 Post processing of natural language automatic speech recognition
摘要 A post-processing speech system includes a natural language-based speech recognition system that compares a spoken utterance to a natural language vocabulary that includes words used to generate a natural language speech recognition result. A master conversation module engine compares the natural language speech recognition result to domain specific words and phrases. A voting engine selects a word or a phrase from the domain specific words and phrases that is transmitted to an application control system. The application control system transmits one or more control signals that are used to control an internal or an external device or an internal or an external process.
申请公布号 US9431012(B2) 申请公布日期 2016.08.30
申请号 US201213460462 申请日期 2012.04.30
申请人 2236008 Ontario Inc. 发明人 Fry Darrin Kenneth
分类号 G10L15/22;G10L15/19;G10L15/32;G10L15/20;G10L15/30 主分类号 G10L15/22
代理机构 Brinks Gilson & Lione 代理人 Brinks Gilson & Lione
主权项 1. A post-processing automated speech recognition system comprising: a natural language-based speech recognition system that compares a spoken utterance to a natural language vocabulary comprising words in one or more active grammars to generate a natural language speech recognition result and a natural language confidence score; a master conversation module engine that post processes the natural language speech recognition result by comparing the natural language speech recognition result generated by the natural language-based speech processing system to lists of words and phrases from a plurality of active sub-grammars that are mapped to a plurality of domain specific words and phrases, and generates a post processed natural language speech recognition result confidence score for each of the listed words and phrases from the plurality of active sub-grammars based on a likelihood that the respective listed words and phrases from the plurality of active sub-grammars match the natural language speech recognition result; a voting module engine that is interfaced to the master conversation module engine and that is operable to select a word or a phrase from the list of words and phrases from the plurality of active sub-grammars, based on the post processed natural language speech recognition result confidence scores; wherein: when the voting module engine selects a word or phrase from the plurality of active sub-grammars based on the post processed natural language speech recognition result confidence scores, the selected word or phrase is transmitted to an application control system that transmits control signals used to control an internal or an external device or an internal or an external process; orwhen both of: the voting module does not select a word or phrase from the plurality of active sub-grammars based on the post processed natural language speech recognition result confidence scores, andthe natural language confidence score exceeds a specified threshold,then the master conversation module engine selects the natural language speech recognition result to be transmitted to the application control system; where the natural language-based speech recognition system, the master conversation module engine, and the voting module engine are executed by one or more processors.
地址 Waterloo, Ontario CA