发明名称 Methods and apparatus for rapid acoustic unit selection from a large speech corpus
摘要 A speech synthesis system can select recorded speech fragments, or acoustic units, from a very large database of acoustic units to produce artificial speech. The selected acoustic units are chosen to minimize a combination of target and concatenation costs for a given sentence. However, as concatenation costs, which are measures of the mismatch between sequential pairs of acoustic units, are expensive to compute, processing can be greatly reduced by pre-computing and caching the concatenation costs. Unfortunately, the number of possible sequential pairs of acoustic units makes such caching prohibitive. A method for constructing an efficient concatenation cost database is provided by synthesizing a large body of speech, identifying the acoustic unit sequential pairs generated and their respective concatenation costs. By constructing a concatenation cost database in this fashion, the processing power required at run-time is greatly reduced with negligible effect on speech quality.
申请公布号 US8086456(B2) 申请公布日期 2011.12.27
申请号 US20100839937 申请日期 2010.07.20
申请人 BEUTNAGEL MARK CHARLES;MOHRI MEHRYAR;RILEY MICHAEL DENNIS;AT&T INTELLECTUAL PROPERTY II, L.P. 发明人 BEUTNAGEL MARK CHARLES;MOHRI MEHRYAR;RILEY MICHAEL DENNIS
分类号 G10L13/04;G10L13/06 主分类号 G10L13/04
代理机构 代理人
主权项
地址
您可能感兴趣的专利