发明名称 Speech Recognition Using Loosely Coupled Components
摘要 An automatic speech recognition system includes an audio capture component, a speech recognition processing component, and a result processing component which are distributed among two or more logical devices and/or two or more physical devices. In particular, the audio capture component may be located on a different logical device and/or physical device from the result processing component. For example, the audio capture component may be on a computer connected to a microphone into which a user speaks, while the result processing component may be on a terminal server which receives speech recognition results from a speech recognition processing server.
申请公布号 US2016336011(A1) 申请公布日期 2016.11.17
申请号 US201615218492 申请日期 2016.07.25
申请人 MModal IP LLC 发明人 Koll Detlef;Finke Michael
分类号 G10L15/22;G10L15/30 主分类号 G10L15/22
代理机构 代理人
主权项 1. A system comprising: an audio capture component, the audio capture component comprising means for capturing a first audio signal representing first speech of a user to produce a first captured audio signal; a speech recognition processing component comprising means for performing automatic speech recognition on the first captured audio signal to produce first speech recognition results; a first result processing component, the first result processing component comprising first means for processing the first speech recognition results to produce first result output; a second result processing component, the second result processing component comprising second means for processing the first speech recognition results to produce second result output; a context sharing component comprising means for identifying a first one of the first and second result processing components as being associated with a first context of the user at a first time, the context sharing component further comprising: means for receiving credentials from the user;means for identifying, based on the credentials, a list of at least one result processing component authorized for use on behalf of the user at the first time; andmeans for determining that the at least one result processing component in the list is associated with the context of the user at the first time; and speech recognition result provision means for providing the first speech recognition results to the identified first one of the first and second result processing components.
地址 Franklin TN US