摘要 |
A speaker recognition system including a voiced sound detector for detecting a voiced sound sample from an input utterance of a speaker. A prefiltering circuit derives from the voiced sound sample a compensation parameter indicating a decaying characteristics of a high frequency component of the voiced sound sample and compensates for the voiced sound sample in accordance with the compensation parameter. An estimation circuit is provided for estimating a glottal excitation source pulse of the vocal tract system of the speaker from the compensated voiced sound sample. A glottal pulse-shape is simulated from the estimated glottal excitation source pulse using the compensation parameter detected by the prefiltering circuit. The simulated glottal pulse-shape is analyzed to determine vocal features of the speaker, and a decision is made whether the determined features coincide with reference features stored in a pattern memory.
|