发明名称 Methods and systems for adapting grammars in hybrid speech recognition engines for enhancing local SR performance
摘要 A speech recognition method includes providing a processor communicatively coupled to each of a local speech recognition engine and a server-based speech recognition engine. A first speech input is inputted into the server-based speech recognition engine. A first recognition result from the server-based speech recognition engine is received at the processor. The first recognition result is based on the first speech input. The first recognition result is stored in a memory device in association with the first speech input. A second speech input is inputted into the local speech recognition engine. The first recognition result is retrieved from the memory device. A second recognition result is produced by the local speech recognition engine. The second recognition result is based on the second speech input and is dependent upon the retrieved first recognition result.
申请公布号 US9153229(B2) 申请公布日期 2015.10.06
申请号 US201213683312 申请日期 2012.11.21
申请人 Robert Bosch GmbH 发明人 Xu Kui;Weng Fuliang;Feng Zhe
分类号 G10L15/00;G10L15/01;G10L15/065;G10L15/30 主分类号 G10L15/00
代理机构 Maginot Moore & Beck LLP 代理人 Maginot Moore & Beck LLP
主权项 1. A speech recognition method, comprising the steps of: providing a processor communicatively coupled to each of a local speech recognition engine and a server-based speech recognition engine; inputting a first speech input into the server-based speech recognition engine; receiving at the processor a first recognition result from the server-based speech recognition engine, the first recognition result being based on the first speech input; storing the first recognition result in a memory device, the first recognition result being stored in association with the first speech input; inputting a second speech input into the local speech recognition engine; retrieving the first recognition result from the memory device; producing a second recognition result by the local speech recognition engine, the second recognition result being based on the second speech input and being dependent upon the retrieved first recognition result; counting a number of times that the user confirms the correctness of the first recognition result, the second recognition result being dependent upon the retrieved first recognition result only if the number of times that the user has confirmed the correctness of the first recognition result is greater than a predetermined number.
地址 Stuttgart DE