摘要 |
<p>A method of user speech performance evaluation with respect to a reference performance for which a phoneme mark-up is available comprises the steps of capturing input speech from the user and formatting it as frames, and for a respective frame of the input speech, generating probability values for a plurality of phonemes, and generating a probability value for a phoneme class based upon the generated probability values for a plurality of phonemes belonging to that phoneme class, and for a plurality of frames of the input speech, averaging the phoneme class probability values corresponding to the plurality of frames of the input speech, and calculating a user speech performance score based upon the average.</p> |