发明名称 SPEECH RECOGNITION SYSTEM AND METHOD OF SPEAKER ADAPTATION
摘要 PROBLEM TO BE SOLVED: To improve recognition performance with a small study data, to expect further improvement in performance if a large amount of data are collected, and moreover, to eliminate users' load for adaptation as much as possible. SOLUTION: In a speaker adaptation mode, a phoneme label group decision part 13 requests for correct answer phoneme group information concerning an input speech of a specific speaker by collation with HMM in a dictionary storage part 15 corresponding to a correct answer phoneme group, and also requests for best phoneme group information for a score to become a maximum by collation with all HMM(Hidden Markov Model) in a dictionary storage part 15. An adaptation part 14 studies an average vector and a variance of a phoneme HMM in the dictionary storage part 15 based on the maximum posterior probability presumption method according to a correct answer phoneme group information adaptation, and further, extracts a speech pattern where a phoneme label different from the correct answer phoneme label is allocated, by comparing a phoneme label group in the correct answer phoneme group information with that in the optimal phoneme group information, and subtracts the speech pattern from the average vector of phoneme HMM corresponding to the phoneme label.
申请公布号 JPH10207485(A) 申请公布日期 1998.08.07
申请号 JP19970009777 申请日期 1997.01.22
申请人 TOSHIBA CORP 发明人 KANAZAWA HIROSHI
分类号 G10L15/06;G10L15/10;G10L15/14;G10L15/18;(IPC1-7):G10L3/00;G10L3/00 主分类号 G10L15/06
代理机构 代理人
主权项
地址
您可能感兴趣的专利