摘要 |
A speech recognition feature extractor for extracting speech features from a speech signal, comprising: a time-to-frequency domain transformer (FFT) for generating spectral magnitude values in the frequency domain from the speech signal; a frequency domain filtering block (Mel) for generating a sub-band value relating to spectral magnitude values of a certain frequency sub-band; a compression block (LOG) for compressing said sub-band values; a transformation block (DCT) for obtaining a set of de-correlated features from the compressed sub-band values; and normalising block (CN) for normalising de-correlated features. |