发明名称 Method and system for enhancing a speech database
摘要 A system, method and computer readable medium that enhances a speech database for speech synthesis is disclosed. The method may include labeling audio files in a primary speech database, identifying segments in the labeled audio files that have varying pronunciations based on language differences, identifying replacement segments in a secondary speech database, enhancing the primary speech database by substituting the identified secondary speech database segments for the corresponding identified segments in the primary speech database, and storing the enhanced primary speech database for use in speech synthesis.
申请公布号 US8977552(B2) 申请公布日期 2015.03.10
申请号 US201414288815 申请日期 2014.05.28
申请人 AT&T Intellectual Property II, L.P. 发明人 Conkie Alistair D.;Syrdal Ann K.
分类号 G10L13/00;G10L13/06;G10L25/00;G10L21/00;G10L13/08 主分类号 G10L13/00
代理机构 代理人
主权项 1. A method comprising: selecting, via a processor, a speech segment associated with text, wherein the speech segment is selected from a primary speech database which has been modified by: identifying primary speech segments in the primary speech database which do not meet a need of a text-to-speech process;identifying replacement speech segments which satisfy the need in a secondary speech database; andenhancing the primary speech database by substituting, in the primary database, the primary speech segments with the replacement speech segments; and generating, via the processor, speech corresponding to the text using the speech segment.
地址 Atlanta GA US