摘要 |
In a speech recognition apparatus, a feature extracting portion extracts feature parameters by sliding a plurality of frames corresponding to time windows each having a prescribed length of time with a successively increasing time width, over an input speech signal. A word lexicon database stores standard pattern data in correspondence with phoneme patterns of the input speech. A recognition processing portion collates the feature parameter extracted by the feature extracting portion with the standard pattern data to recognize a corresponding phoneme, and outputs a recognition result.
|