发明名称 |
PURE SPEECH DETECTION USING VALLEY PERCENTAGE |
摘要 |
A speech detection method detects pure-speech signal in an audio signal containing a mixture of pure-speech and non- or mixed-speech signals. The method detects the pure-speech signals by computing a novel Valley Percentage feature, a measurement of the low energy parts of the signal, and performing a threshold decision on this feature. The method further employs a morphological closing filter to eliminate unwanted noise prior detection, and after, a combination of morphological closing and opening filters to remove aberrant pure- or non-speech classifications resulting from impulsive audio signals, in order to more accurately detect the boundaries between the pure- and non-speech portions of the signal.
|
申请公布号 |
WO0033294(A9) |
申请公布日期 |
2001.07.05 |
申请号 |
WO1999US28401 |
申请日期 |
1999.11.30 |
申请人 |
MICROSOFT CORPORATION |
发明人 |
GU, CHUANG;LEE, MING-CHIEH;CHEN, WEI-GE |
分类号 |
G10L11/02;G10L15/04;(IPC1-7):G10L11/02 |
主分类号 |
G10L11/02 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|