摘要 |
Method for speech recognition comprising inputting frames comprising samples of an audio signal; forming a feature vector comprising a first number of vector components for each frame; projecting the feature vector onto at least two subspaces so that the number of components of each projected feature vector is less than the first number and the total number of components of the projected feature vectors is the same as the first number; defining a set of mixture models for each projected vector which provides the highest observation probability; analyzing the set of mixture models to determine the recognition result. When the recognition result is found, the method comprises determining a confidence measure for the recognition result, the determining comprising determining a probability that the recognition result is correct; determining a normalizing term; and dividing the probability by the normalizing term.
|