摘要 |
<p>Spoken name speed dialing is accomplished through a media server which compares a digitized audio stream to speaker dependent and speaker independent models. At least some of the speaker dependent models are generated dynamically for each voice recognition session. The speaker dependent models are penalized according to their respective lengths in order to improve out-of-vocabulary rejection while minimizing any adverse effects on in-vocabulary performance.</p> |