摘要 |
PROBLEM TO BE SOLVED: To reduce an operation amount, to prevent deterioration in dynamic feature amount precision and to prevent a lowering of a recognition rate by using an operated static feature amount and an interpolated static feature amount to operate a dynamic feature amount and forming a feature vector from the dynamic feature amount. SOLUTION: A static feature amount operation means 101 operates the LPC melcepstrum of the static feature amount in the frame of the inputted voice data. A static feature amount interpolation means 102 interpolates the LPC melcepstrum of a pseudo frame placed between two adjacent frames operated by the static feature amount operation means 101. A dynamic feature amount operation means 103 operates the dynamic feature amount of the LPC melcepstrum of the frame by using the operated LPC melcepstrum and the interpolated LPC melcepstrum. A feature vector forming means 103 forms the feature vector of the frame from the static feature amount and the dynamic feature amount. A collation means 105 outputs the recognition result by collating the feature vector with a standard pattern for a time series.
|