发明名称 |
Synthesizing speech from text |
摘要 |
Speech is synthesized for a given text by determining a sequence of phonetic components based on the text, determining a sequence of target phonetic elements associated phonetic components, determining a sequence of target event types associated with the phonetic components and determining a sequence of speech units from a plurality of stored speech unit candidates by use of a cost function. The cost function comprises a unit cost, a concatenation cost, and an event type cost for each speech unit in the sequence of speech units. The unit cost of a speech unit is determined with respect to the corresponding target phonetic element, while the concatenation cost of a speech unit is determined with respect to adjacent speech units and the event type cost of each speech unit is determined with respect to the corresponding target event type.
|
申请公布号 |
US8249874(B2) |
申请公布日期 |
2012.08.21 |
申请号 |
US20080036971 |
申请日期 |
2008.02.25 |
申请人 |
MOEHLER GREGOR;ZEHNPFENNING ANDREAS;NUANCE COMMUNICATIONS, INC. |
发明人 |
MOEHLER GREGOR;ZEHNPFENNING ANDREAS |
分类号 |
G10L13/06;G10L13/08;G10L13/10 |
主分类号 |
G10L13/06 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|