摘要 |
PROBLEM TO BE SOLVED: To obtain a result of voice recognition, substantially simultaneously with the completion of the voice. SOLUTION: After voice data from a voice input device 21 is taken into a voice data processing part 22, and is fed to a successive recording and processing part 23 so as to successively recorded on a recording buffer memory which is sectioned into a plurality of parts. Thus, when recording into one of the parts of the memory is completed, recording to the next memory is started while voice data with which recording is completed, are inputted in a feature extracting part 24a constituting a recognizing part 24, and thereafter the voice data obtains is subjected to frequency analysis so as to obtain a spectrum row which is then inputted a sound element recognizing part 24b constituted by a neural network so as to obtain a sound element candidate row. This candidate row is inputted to a word spot 24c and is verified with a dictionary template 24d and DTW so as to deliver a most resembling word as a result. The output results are calculated by an integrated distance calculating part 25, and thus calculated value is delivered to a back trance part 26 in order to take out a series of recognized words. |