摘要 |
A voice pattern in the form of a matrix and comprised of a plurality of frames, each including time-spectral information and temporal information, is formed from an unknown input voice signal. The voice pattern is compared with each of the voice patterns of a library of known voices partly to select a plurality of candidate voices. Each of the library voices has a predetermined eigenvector and an inner product frequency distribution of inner products between the eigenvector and the frames of its voice pattern. Then, inner products between the voice pattern of the input voice signal and the eigenvector of each of the candidate library voice are calculated. One of the plurality of candidate library voices whose predetermined inner product frequency distribution is most similar to one of the thus calculated inner product frequency distributions is selected to identify the input voice signal. |