发明名称 Noise playback enhancement of prerecorded audio for speech recognition operations
摘要 A speech processing method including the step of identifying prerecorded audio comprising an original speech segment and a corresponding original noise segment. An audio stream can be generated from the prerecorded audio. The audio stream can comprise a stream speech segment and a stream noise segment. The stream speech segment can have approximately a same duration as the original speech segment. The stream noise segment can have a longer duration than the original noise segment. The audio stream can be conveyed to a speech recognition engine. The speech recognition engine can automatically determine an end of utterance condition based upon the stream noise segment. The original noise segment can be of insufficient duration for the speech recognition engine to determine the end of utterance condition. Responsive to the determining of the end of utterance condition, the stream speech segment can be speech recognized.
申请公布号 US2007106507(A1) 申请公布日期 2007.05.10
申请号 US20050269921 申请日期 2005.11.09
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 CHAROENRUENGKIT WERAYUTH T.;HANSON GARY R.;PALGON JON
分类号 G10L15/20 主分类号 G10L15/20
代理机构 代理人
主权项
地址