发明名称 SPEECH RECOGNITION ASSISTED EVALUATION ON TEXT-TO-SPEECH PRONUNCIATION ISSUE DETECTION
摘要 Pronunciation issues for synthesized speech are automatically detected using human recordings as a reference within a Speech Recognition Assisted Evaluation (SRAE) framework including a Text-To-Speech flow and a Speech Recognition (SR) flow. A pronunciation issue detector evaluates results obtained at multiple levels of the TTS flow and the SR flow (e.g. phone, word, and signal level) by using the corresponding human recordings as the reference for the synthesized speech, and outputs possible pronunciation issues. A signal level may be used to determine similarities/differences between the recordings and the TTS output. A model level checker may provide results to the pronunciation issue detector to check the similarities of the TTS and the SR phone set including mapping relations. Results from a comparison of the SR output and the recordings may also be evaluation by the pronunciation issue detector. The pronunciation issue detector outputs a list that lists potential pronunciation issue candidates.
申请公布号 US2014257815(A1) 申请公布日期 2014.09.11
申请号 US201313785573 申请日期 2013.03.05
申请人 MICROSOFT CORPORATION 发明人 Zhao Pei;Yan Bo;He Lei;Geng Zhe;Leung Yiu-Ming
分类号 G10L13/08 主分类号 G10L13/08
代理机构 代理人
主权项 1. A method for determining pronunciation issues, comprising: receiving text comprising sentences for a Text-To-Speech (TTS) component and a recording of the text that is used as a reference for the text; receiving synthesized speech generated by the TTS component using the text as input to the TTS component; evaluating results received by an evaluation performed at a text level by determining a similarity of the synthesized speech to the recording; evaluating results obtained from a Speech Recognition (SR) component related to different inputs to the SR component comprising the synthesized speech and the recording; and generating a list that includes a ranking of pronunciation issue candidates based on the evaluations.
地址 Redmond WA US