发明名称 DYNAMIC SPEECH RECOGNITION AND TRANSCRIPTION AMONG USERS HAVING HETEROGENEOUS PROTOCOLS
摘要 A system is disclosed for facilitating free form dictation, including directed dictation and constrained recognition and/or structured transcription among users having heterogeneous native (legacy) protocols for generating, transcribing, and exchanging recognized and transcribed speech. The system includes at least one system transaction manager having a “system protocol,” to receive a verified, streamed speech information request from at least one authorized user employing a first legacy user protocol. The speech information request which includes spoken text and system commands is generated using a user interface capable of bi-directional communication with the system transaction manager and supporting dictation applications, including prompts to direct user dictation in response to user system protocol commands and systems transaction manager commands. A speech recognition and/or transcription engine (ASR), in communication with the systems transaction manager, receives the speech information request from the system transaction manager, generates a transcribed response, which can include a formatted transcription, and transmits the response to the system transaction manager. The system transaction manager routes the response to one or more of the users employing a second protocol, which may be the same as or different than the first protocol. In another embodiment, the system employs a virtual sound driver for streaming free form dictation to any ASR, regardless of the ASR's ability to recognize and/or transcribe spoken text from any input source such as, for example, a live microphone or line input. In another embodiment, the system employs a buffer to facilitate the system's use of ASRs requiring input data to be in batches, while providing the user with an uninterrupted, seamless dictating experience.
申请公布号 US2015348552(A1) 申请公布日期 2015.12.03
申请号 US201514821786 申请日期 2015.08.10
申请人 Advanced Voice Recognition Systems, Inc. 发明人 Miglietta Joseph H.;Davis Michael K.
分类号 G10L15/26;G10L15/20 主分类号 G10L15/26
代理机构 代理人
主权项 1. A system for facilitating free form dictation and constrained speech recognition and/or structured transcription among users having heterogeneous system protocols the system comprising: at least one system transaction manager using a uniform system protocol, adapted to receive a verified streamed speech information request from at least one user employing a first user legacy protocol, and configured to route a response to one or more users employing a second user legacy protocol, the speech information request comprised of free form dictation of spoken text and commands and the response comprised of a transcription of spoken text; a user interface capable of bi-directional communication with the system transaction manager and supporting dictation applications, including prompts to direct user dictation in response to user system protocol commands and system transaction manager commands the user interface being in bi-directional communication with the systems transaction manager; and, at least one speech recognition and/or transcription engine communicating with the system systems transaction manager wherein the speech recognition and/or transcription engine is configured to receive the speech information request containing spoken text and commands for constrained speech recognition transmitted by the systems transaction manager, to generate structured transcription in response to the speech information request, and to transmit the response comprised of structured transcription to the system transaction manager.
地址 Scottsdale AZ US