发明名称 Speech recognition system including speech section detecting section
摘要 A trained vector generation section 16 generates beforehand a trained vector v of unvoiced sounds. An LPC Cepstrum analysis section 18 generates a feature vector A of a voice within the non-voice period, an inner product operation section 19 calculates an inner product value V<SUP>T</SUP>A between the feature vector A and the trained vector V, and a threshold generation section 20 generates a threshold thetav on the basis of the inner product value V<SUP>T</SUP>A. Also, the LFC Cepstrum analysis section 18 generates a prediction residual power epsilon of the signal within the non-voice period, and the threshold generation section 22 generates a threshold THD on the basis of the prediction residual power epsilon. If the voice is actually uttered, the LPC Cepstrum analysis section 18 generates the feature vector A and the prediction residual power epsilon, the inner product operation section 19 calculates an inner product value V<SUP>T</SUP>A between the feature vector A of input signal Saf and the trained vector V, and a threshold determination section 21 compares the inner product value V<SUP>T</SUP>A with the threshold thetav and determines the voice section if thetav<=V<SUP>T</SUP>A. Also, a threshold determination section 23 compares the prediction residual power epsilon of input signal Saf with the threshold THD and determines the voice section if THD<=epsilon. The voice section is finally defined if thetav<=V<SUP>T</SUP>A or THD<=epsilon, and the input signal Svc for voice recognition is extracted.
申请公布号 US7035798(B2) 申请公布日期 2006.04.25
申请号 US20010949980 申请日期 2001.09.12
申请人 PIONEER CORPORATION 发明人 KOBAYASHI HAJIME
分类号 G10L15/20;G10L11/02;G10L15/02;G10L15/04;G10L15/06;G10L15/08 主分类号 G10L15/20
代理机构 代理人
主权项
地址