发明名称 APPARATUS AND METHOD FOR EXTRACTING SPEAKER'S FEATURE, SPEECH RECOGNITION DEVICE, SPEECH SYNTHESIZER, AND PROGRAM RECORDING MEDIUM
摘要 PROBLEM TO BE SOLVED: To stably exact a speaker's feature from few utterance data. SOLUTION: A frequency function estimation part 5 is provided with a phonemic limit estimation part and a maximum likelihood estimation part. During learning, the phonemic sequence (language model) of contents of utterance or the language model of a feeble grammar is applied to estimate phonemic limit information by a Vitervi algorithm. Furthermore, a frequency warping function f() is estimated about a phonemic section selected on the basis of the phonemic limit information. Moreover, during recognition, the phonemic limit information is estimated by applying the language model of a feeble grammar. A frequency warpage part 4 performs the frequency warp of input sound parameter series about the phonemic section selected on the basis of the phonemic limit information. Thus, the deformation of a phoneme or an unvoiced sound part which is hardly influenced with respect to the difference of a vocal tract length is prevented by precluding the phoneme or the unvoiced sound part which is hardly influenced with respect to the difference of the vocal tract length from the subject of learning or normalization. Thereby, the speaker's feature is stably extracted from few utterance data.
申请公布号 JP2002189491(A) 申请公布日期 2002.07.05
申请号 JP20000385201 申请日期 2000.12.19
申请人 SHARP CORP 发明人 YAMAGUCHI KOICHI;HACHIMAN YOICHIRO
分类号 G10L15/06;G10L15/04;G10L15/12;G10L15/14;(IPC1-7):G10L15/06 主分类号 G10L15/06
代理机构 代理人
主权项
地址