摘要 |
<p><P>PROBLEM TO BE SOLVED: To provide a technical means for improving precision of evaluation of similarity of a speech by greatly reflecting individual differences of a speaker on a feature quantity of the speech. <P>SOLUTION: A frequency analysis part 51 and an envelop-by-band generation part 52 extract components belonging to a plurality of bands spaced on a frequency axis from an input speech, and output their envelopes E-i (i=1 to N). A correlation value calculation part 53 calculates correlation values ajk between E-j and E-k as to all combinations (j, k) within a range of j=1 to N, k=1 to N and outputs an inter-vand correlation matrix including them as elements. The inter-band correlation matrix is used as a feature quantity of the speech to evaluate the similarity of the speech. <P>COPYRIGHT: (C)2008,JPO&INPIT</p> |