摘要 |
PROBLEM TO BE SOLVED: To evade erroneous recognition by outputting a recognition result candidate the highest in similarity as the first rank of the recognition result when there is not difference in overlapped frame length axists. SOLUTION: A dictionary collation part 4 outputs the recognition result candidate when the similarity between a characteristic vector for every frame converted by a characteristic extraction part 3 and a voice standard pattern registered in a word dictionary 1 exceeds a threshold valueα. Then, a frame superposition judging part 9 in a recognition result selection part 7 sends the recognition result candidate that the frame of a certain recognition candidates don't overlap the frame of another recognition result candidate in the same voice section detected by a voice section detection part 6 to a recognition result output part 8, and sends the recognition result candidate overlapping each other to a frame length comparison part 10. The frame length comparison part 10 selects the recognition result candidate with the longest frame length to send it to the recognition result output part 8, and sends the recognition result candidate with no difference of the superposition frame length to a similarity judging part 11. The similarity judging part 11 sends the recognition result candidate with the largest similarity to the recognition result output part 8.
|