发明名称 EVALUATING SPEECH INTELLIGIBILITY OF TEXT-TO-SPEECH SYNTHESIS USING TEMPLATE|CONSTRAINED GENERALIZED POSTERIOR PROBABILITY
摘要 Instead of relying on humans to subjectively evaluate speech intelligibility of a subject, a system objectively evaluates the speech intelligibility. The system receives speech input and calculates confidence scores at multiple different levels using a Template Constrained Generalized Posterior Probability algorithm. One or multiple intelligibility classifiers are utilized to classify the desired entities on an intelligibility scale. A specific intelligibility classifier utilizes features such as the various confidence scores. The scale of the intelligibility classification can be adjusted to suit the application scenario. Based on the confidence score distributions and the intelligibility classification results at multiple levels an overall objective intelligibility score is calculated. The objective intelligibility scores can be used to rank different subjects or systems being assessed according to their intelligibility levels. The speech that is below a predetermined intelligibility (e.g. utterances with low confidence scores and most severe intelligibility issues) can be automatically selected for further analysis.
申请公布号 WO2014015087(A1) 申请公布日期 2014.01.23
申请号 WO2013US50969 申请日期 2013.07.18
申请人 MICROSOFT CORPORATION 发明人 WANG, LINFANG;TENG, YAN;WANG, LIJUAN;SOONG, FRANK KAO-PING;GENG, ZHE;WALLER, WILLIAM BRAD;HANSON, MARK TILLMAN
分类号 G10L25/69;G10L13/00 主分类号 G10L25/69
代理机构 代理人
主权项
地址