发明名称 Speaker independent speech recognition method utilizing multiple training iterations
摘要 A method for recognizing spoken utterances of a speaker is disclosed, the method comprising the steps of providing a database of labeled speech data; providing a prototype of a Hidden Markov Model (HMM) definition to define the characteristics of the HMM; and parameterizing speech utterances according to one of linear prediction parameters or Mel-scale filter bank parameters. The method further includes selecting a frame period for accommodating the parameters and generating HMMs and decoding to specified speech utterances by causing the user to utter predefined training speech utterances for each HMM. The method then statistically computes the generated HMMs with the prototype HMM to provide a set of fully trained HMMs for each utterance indicative of the speaker. The trained HMMs are used for recognizing a speaker by computing Laplacian distances via distance table lookup for utterances of the speaker during the selected frame period; and iteratively decoding node transitions corresponding to the spoken utterances during the selected frame period to determine which predefined utterance is present.
申请公布号 US5806034(A) 申请公布日期 1998.09.08
申请号 US19950510321 申请日期 1995.08.02
申请人 ITT CORPORATION 发明人 NAYLOR, JOE A.;HUANG, WILLIAM Y.;BAHLER, LAWRENCE G.
分类号 G10L15/14;(IPC1-7):G10L9/06 主分类号 G10L15/14
代理机构 代理人
主权项
地址