发明名称 Detection of voice inactivity within a sound stream
摘要 A method for identifying end of voiced speech within an audio stream of a noisy environment employs a speech discriminator. The discriminator analyzes each window of the audio stream, producing an output corresponding to the window. The output is used to classify the window in one of several classes, for example, (1) speech, (2) silence, or (3) noise. A state machine processes the window classifications, incrementing counters as each window is classified: speech counter for speech windows, silence counter for silence, and noise counter for noise. If the speech counter indicates a predefined number of windows, the state machine clears all counters. Otherwise, the state machine appropriately weights the values in the silence and noise counters, adds the weighted values, and compares the sum to a limit imposed on the number of non-voice windows. When the non-voice limit is reached, the state machine terminates processing of the audio stream.
申请公布号 US8370144(B2) 申请公布日期 2013.02.05
申请号 US20100793663 申请日期 2010.06.03
申请人 APPLIED VOICE & SPEECH TECHNOLOGIES, INC.;GIERACH KARL D. 发明人 GIERACH KARL D.
分类号 G10L17/00;G10L11/02 主分类号 G10L17/00
代理机构 代理人
主权项
地址