发明名称 RECOGNITION OF SPEECH IN EDITABLE AUDIO STREAMS
摘要 A speech processing system divides a spoken audio stream into partial audio streams ("snippets"). The system may divide a portion of the audio stream into two snippets at a position at which the speaker performed an editing operation, such as pausing and then resuming recording, or rewinding and then resuming recording. The snippets may be transmitted sequentially to a consumer, such as an automatic speech recognizer or a playback device, as the snippets are generated. The consumer may process (e.g., recognize or play back) the snippets as they are received. The consumer may modify its output in response to editing operations reflected in the snippets. The consumer may process the audio stream while it is being created and transmitted even if the audio stream includes editing operations that invalidate previously-transmitted partial audio streams, thereby enabling shorter turnaround time between dictation and consumption of the complete audio stream.
申请公布号 WO2008064358(A2) 申请公布日期 2008.05.29
申请号 WO2007US85472 申请日期 2007.11.23
申请人 MULTIMODAL TECHNOLOGIES, INC.;CARRAUX, ERIC;KOLL, DETLEF 发明人 CARRAUX, ERIC;KOLL, DETLEF
分类号 G10L15/00;G06F17/30;G11B20/10 主分类号 G10L15/00
代理机构 代理人
主权项
地址