摘要 |
To provide a speech recognition apparatus and the method thereof for correctly recognizing an English word from a non-native English pronunciation, for example. A vector data generating part (180) and a label generating part (182) processes speech data of 1 sentence English speech pronounced by a Japanese to convert it to a label string. A candidate word generating part (184) correlates the label string of 1 sentence to a first candidate word comprising one or more English words. A analogous word adding part (186) uses a word database (160) to search an English word analogous the pronunciation of the first candidate word, such as a analogous word "lead" for a first candidate word "read", for example, (it is difficult for a Japanese to discriminate between "l" and "r" in pronunciation), and adds the obtained analogous word to the first candidate word to make it be a second candidate word. A selection part (188) selects one of the second candidate words as a final result of recognition in response to users operation and connects the selected words into English text data for output. <IMAGE> |