发明名称
摘要 PROBLEM TO BE SOLVED: To stably extract a speaker's feature without depending on the contents of utterance data. SOLUTION: A frequency function estimation part 5 is provided with a phonemic limit estimation part, a frequency measuring part, and a mode extracting part. During learning, the frequency function estimation part 5 estimates phonemic limit information, and performs the maximum likelihood estimation of the coefficient α of the frequency warping function f of each sample about a phonemic section selected on the basis of the phonemic limit information. Furthermore, the frequency function estimation part 5 determines a distribution function H (α) which represents a frequency distribution about the coefficient αof each sample, and estimates a coefficient a which provides a mode value as the optimal coefficient of the frequency warping function f. Consequently, a correct frequency warping function f can be estimated even when a plurality of peaks are present in the frequency distribution, and the speaker's feature can stably be extracted without depending on the contents of utterance data.
申请公布号 JP3754614(B2) 申请公布日期 2006.03.15
申请号 JP20000385212 申请日期 2000.12.19
申请人 发明人
分类号 G10L15/06;G10L15/10;G10L15/14;G10L15/20;G10L21/02 主分类号 G10L15/06
代理机构 代理人
主权项
地址
您可能感兴趣的专利