摘要 |
PURPOSE:To improve the recognition rate for a voiceless plosive sound and the voice of a device by recognizing a voiceless signal by a high-order level recognition part by coupling a signal of a phoneme sequence other than the voiceless plosive sound and an identification signal for the phoneme of the voiceless plosive sound in time series, and outputting the recognition result. CONSTITUTION:A voice start/end detection part 1 identifies a voice signal S1 in an input signal S0, extracts a voiceless plosive sound signal S2 and a nonvoiceless plosive sound signal S3 in time series, and inputs them to a waveform envelope detection part 4 and a phoneme analysis part 2. The envelope waveform of the signal S2 inputted to the waveform envelope detection part 4 is detected and a voiceless plosive sound identification part 5 identifies which voiceless plosive phoneme the detected envelope corresponds to and outputs an identification signal S7 to a high-order level recognition part 6. A word containing no voiceless plosive sound or continuous words are converted by the analysis part 2 and phoneme recognition part 3 into a feature vector signal S4 and a phoneme signal S5, which are inputted to the high-order level recognition part 6. The signal which is inputted to the high-order level recognition part 6 is converted into word candidates or syllable candidate sequence, which is outputted as a recognition signal S8. Consequently, the recognition rate for the voiceless plosive sound and the sound of the whole device is improved. |