发明名称 Speech recognition using loosely coupled components
摘要 An automatic speech recognition system includes an audio capture component, a speech recognition processing component, and a result processing component which are distributed among two or more logical devices and/or two or more physical devices. In particular, the audio capture component may be located on a different logical device and/or physical device from the result processing component. For example, the audio capture component may be on a computer connected to a microphone into which a user speaks, while the result processing component may be on a terminal server which receives speech recognition results from a speech recognition processing server.
申请公布号 US9208786(B2) 申请公布日期 2015.12.08
申请号 US201514636774 申请日期 2015.03.03
申请人 MModal IP LLC 发明人 Koll Detlef;Finke Michael
分类号 G10L15/00;G10L15/30;G10L15/22 主分类号 G10L15/00
代理机构 Robert Plotkin, P.C. 代理人 Robert Plotkin, P.C. ;Plotkin Robert
主权项 1. A system comprising: an audio capture component, the audio capture component comprising means for capturing a first audio signal representing first speech of a user to produce a first captured audio signal; a speech recognition processing component comprising means for performing automatic speech recognition on the first captured audio signal to produce first speech recognition results; a first result processing component, the first result processing component comprising first means for processing the first speech recognition results to produce first result output; a second result processing component, the second result processing component comprising second means for processing the first speech recognition results to produce second result output; a context sharing component comprising means for identifying a first one of the first and second result processing components as being associated with a first context of the user at a first time, the context sharing component further comprising: means for identifying a list of at least one result processing component authorized for use on behalf of the user at the first time; andmeans for determining that the at least one result processing component in the list is associated with the context of the user at the first time; and speech recognition result provision means for providing the first speech recognition results to the identified first one of the first and second result processing components.
地址 Franklin TN US