发明名称 |
SPEECH SYNTHESIS SYSTEM, SPEECH SYNTHESIS PROGRAM PRODUCT, AND SPEECH SYNTHESIS METHOD |
摘要 |
Waveform concatenation speech synthesis with high sound quality. Prosody with both high accuracy and high sound quality is achieved by performing a two-path search including a speech segment search and a prosody modification value search. An accurate accent is secured by evaluating the consistency of the prosody by using a statistical model of prosody variations (the slope of fundamental frequency) for both of two paths of the speech segment selection and the modification value search. In the prosody modification value search, a prosody modification value sequence that minimizes a modified prosody cost is searched for. This allows a search for a modification value sequence that can increase the likelihood of absolute values or variations of the prosody to the statistical model as high as possible with minimum modification values.
|
申请公布号 |
US2013268275(A1) |
申请公布日期 |
2013.10.10 |
申请号 |
US201213731268 |
申请日期 |
2012.12.31 |
申请人 |
NUANCE COMMUNICATIONS, INC. |
发明人 |
TACHIBANA RYUKI;NISHIMURA MASAFUMI |
分类号 |
G10L13/00 |
主分类号 |
G10L13/00 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|