摘要 |
<p>A speech recogniser in which the recognition vocabulary is generated from a user's own speech by forming phonemic transcriptions of the user's utterances and using these transcriptions for future recognition purposes. The phonemic transcriptions are generated using a loosely constrained network, preferably one constrained only by noise. The resulting transcriptions therefore bear close resemblance to the user's input speech but require significantly reduced storage requirements compared to known speaker dependent word representations.</p> |