发明名称 |
SPEECH DETECTION USING STOCHASTIC CONFIDENCE MEASURES ON THE FREQUENCY SPECTRUM |
摘要 |
A probabilistic approach is used to classify each frame of the speech signal as speech or non-speech. The speech detection method is based on a frequency spectrum (24) extracted from each frame, such that the value for each frequency band is considered to be a random variable and each frame is considered to be an occurrence of these random variables. Using the frequency spectrums from a non-speech part of the speech signal, a known set of random variables is constructed (26). Next, each unknown frame is evaluated as to whether or not it belongs to this set of random variables by forming a unique random variable (preferably a chi-square value) (28) from a set of random variables associated with the unknown frame. The unique variable is normalized (30) with respect the known set, and then classified (32) as either speech or non-speech using the "Test of Hypothesis". Thus, each frame that belongs to the known set of random variables is classified as non-speech, and each frame that does not belong to the known set of random variables is classified as speech.
|
申请公布号 |
WO0052683(A1) |
申请公布日期 |
2000.09.08 |
申请号 |
WO2000US01798 |
申请日期 |
2000.01.25 |
申请人 |
PANASONIC TECHNOLOGIES, INC. |
发明人 |
GELIN, PHILIPPE;JUNQUA, JEAN-CLAUDE |
分类号 |
G10L11/02;(IPC1-7):G10L15/20 |
主分类号 |
G10L11/02 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|