发明名称 Text-to-speech technology with early emission
摘要 <p>The method is creating a speech output from a succession of input linguistic target elements including target characteristics, where the speech output is formed by concatenating a sequence of selected waveform units, each selected waveform unit corresponding to an input linguistic target element. The method includes repeating iterative sequences of forward steps, backward steps and the creating of speech output until the forward steps have reached the final target element. The same optimal sequence of selected waveform units for all target elements of a succession of input linguistic target elements starting with an initial target element and ending with a final target element as the standard Viterbi search are emitted but the optimal units become available in a pipelined manner without requiring the calculation of path costs for the final target element and without complete backtracking form the final to the initial target element. The latency, i.e. the amount of computation time before outputting selected waveform units for a beginning part of the target sequence is much shorter than in a Viterbi search.</p>
申请公布号 EP2474972(B1) 申请公布日期 2013.12.04
申请号 EP20110150490 申请日期 2011.01.10
申请人 SVOX AG 发明人 WOUTERS, JOHAN;SAFRA, SCHAMAI;HOLM, BLEICKE
分类号 G10L13/06 主分类号 G10L13/06
代理机构 代理人
主权项
地址