发明名称 System and method for supplemental speech recognition by identified idle resources
摘要 Disclosed herein are systems, methods, and computer-readable storage media for improving automatic speech recognition performance. A system practicing the method identifies idle speech recognition resources and establishes a supplemental speech recognizer on the idle resources based on overall speech recognition demand. The supplemental speech recognizer can differ from a main speech recognizer, and, along with the main speech recognizer, can be associated with a particular speaker. The system performs speech recognition on speech received from the particular speaker in parallel with the main speech recognizer and the supplemental speech recognizer and combines results from the main and supplemental speech recognizer. The system recognizes the received speech based on the combined results. The system can use beam adjustment in place of or in combination with a supplemental speech recognizer. A scheduling algorithm can tailor a particular combination of speech recognition resources and release the supplemental speech recognizer based on increased demand.
申请公布号 US9431005(B2) 申请公布日期 2016.08.30
申请号 US201213690671 申请日期 2012.11.30
申请人 AT&T Intellectual Property I, L.P. 发明人 Ljolje Andrej;Gilbert Mazin
分类号 G10L15/32;G10L15/34;G10L15/00;G10L15/28 主分类号 G10L15/32
代理机构 代理人
主权项 1. A method comprising: projecting, via a processor, an expected demand for speech recognition resources; identifying, via the processor and based on the expected demand for speech recognition, a main speech recognizer and a supplemental speech recognizer; assigning the main speech recognizer to recognize a plurality of speech from a plurality of speakers; beginning to process the plurality of speech using the main speech recognizer, to yield main results; and upon determining that first speech recognition results of a first user in the plurality of speakers have a lower accuracy than alternative speech recognition results of a second user: assigning, via the processor, the supplemental speech recognizer to recognize speech from the first user, wherein the assigning of the supplemental speech recognizer is a reallocation of the supplemental speech recognizer away from the second user;continuing to process the speech from the first user using the supplemental speech recognizer and the main speech recognizer, to yield supplemental results; andcombining additional speech recognition results, produced by the main speech recognizer, and the supplemental results.
地址 Atlanta GA US