发明名称 Using a loudness-level-reference segment of audio to normalize relative audio levels among different audio files when combining content of the audio files
摘要 The present invention records a loudness-level-reference segment of audio when creating speech audio files and audio files including background sounds. The speech audio files can then be combined with the background sound containing audio files in any desirable combination. When combining the files, the relative audio level of the files is matched, by matching the loudness-level-reference segments with each other. Any of a variety of known digital signal processing techniques can be used to normalize the component audio files. The combined audio files containing speech and background sounds (e.g. ambient noise) having matching relative audio levels can be used to test and/or train a speech recognition engine or a speech processing system.
申请公布号 US7822498(B2) 申请公布日期 2010.10.26
申请号 US20060463683 申请日期 2006.08.10
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 CHAROENRUENGKIT WERAYUTH T.;FADO FRANCIS;NGUYEN KHA DINH
分类号 G06F17/00;H03G3/00;H04B1/20 主分类号 G06F17/00
代理机构 代理人
主权项
地址