发明名称 METHOD AND DEVICE FOR RECOGNIZING VOICE
摘要 PROBLEM TO BE SOLVED: To reduce a load applied to a computer by calculating convergent likelihood based on mouth shape information during a vocalizing section obtaining a candidate word from a photographing image of the mouth of a speaker. SOLUTION: A mouth shape recognition part 102 recognizes the shape and the movement of the mouth at a vocalizing time from a face image signal S101 (photographed image) read out from an image frame buffer 101. A word dictionary 104 stores syllable information and a phoneme model beforehand obtained related to the word candidate to be recognized. Further, a mouth shape syllable matching part 103 investigates a matching extent between the syllable information inputted from the word dictionary 104 and a syllable obtained from the operation of the mouth shape to output the result (mouth shape syllable matching score). Further, a word candidate convergent part 105 converges the word candidate according to the mouth shape syllable matching score. Then, a voice recognition part 108 compares a line of a voice frame S108 of an inputted sound section with the phoneme model S111 of the word converged by the word candidate convergent part 105, and outputs the word with the highest likelihood as the recognition result.
申请公布号 JPH09325793(A) 申请公布日期 1997.12.16
申请号 JP19960142551 申请日期 1996.06.05
申请人 OKI ELECTRIC IND CO LTD 发明人 FUJII AKIHIRO;MIYAZAKI TOSHIHIKO
分类号 G10L15/24;G10L15/28;(IPC1-7):G10L3/00;G10L5/06 主分类号 G10L15/24
代理机构 代理人
主权项
地址