发明名称 |
SPEECH RECOGNITION ASSISTED EVALUATION ON TEXT-TO-SPEECH PRONUNCIATION ISSUE DETECTION |
摘要 |
Pronunciation issues for synthesized speech are automatically detected using human recordings as a reference within a Speech Recognition Assisted Evaluation (SRAE) framework including a Text-To-Speech flow and a Speech Recognition (SR) flow. A pronunciation issue detector evaluates results obtained at multiple levels of the TTS flow and the SR flow (e.g. phone, word, and signal level) by using the corresponding human recordings as the reference for the synthesized speech, and outputs possible pronunciation issues. A signal level may be used to determine similarities/differences between the recordings and the TTS output. A model level checker may provide results to the pronunciation issue detector to check the similarities of the TTS and the SR phone set including mapping relations. Results from a comparison of the SR output and the recordings may also be evaluation by the pronunciation issue detector. The pronunciation issue detector outputs a list that lists potential pronunciation issue candidates. |
申请公布号 |
US2014257815(A1) |
申请公布日期 |
2014.09.11 |
申请号 |
US201313785573 |
申请日期 |
2013.03.05 |
申请人 |
MICROSOFT CORPORATION |
发明人 |
Zhao Pei;Yan Bo;He Lei;Geng Zhe;Leung Yiu-Ming |
分类号 |
G10L13/08 |
主分类号 |
G10L13/08 |
代理机构 |
|
代理人 |
|
主权项 |
1. A method for determining pronunciation issues, comprising:
receiving text comprising sentences for a Text-To-Speech (TTS) component and a recording of the text that is used as a reference for the text; receiving synthesized speech generated by the TTS component using the text as input to the TTS component; evaluating results received by an evaluation performed at a text level by determining a similarity of the synthesized speech to the recording; evaluating results obtained from a Speech Recognition (SR) component related to different inputs to the SR component comprising the synthesized speech and the recording; and generating a list that includes a ranking of pronunciation issue candidates based on the evaluations. |
地址 |
Redmond WA US |