摘要 |
PURPOSE:To provide a synthetic voice which is more understandable and which is natural, by determining a pose length, and a number and positions of poses in relation to the structure of an entire sentence. CONSTITUTION:When a sentence 'This is a voice synthesizing device' is inputted, a text analyzing part 2 divides a word so as to obtain data for an accent and reading. Then, in a rhythm processing part 4, a time length setting part 4a and an FO setting part 4d set time lengths of sound elements and basic frequencies, respectively. Further, a time cumulating part 4c obtains a total time length, excepting a pose part, and with the use of this total time length, a pose cycle number and a pose length are determined from predetermined formulae. In the previous example, a uniform pose length which is relatively long is applied to each of 'This is' and ',', and accordingly, unnatural voice is corrected. Then, a parameter forming part 5 obtains synthesizing parameters such as a formant value and an amplitude, and a parameter interpolating part 7 interpolates the parameters. A voice synthesizing part 7 synthesizes a voice with the use of formant type synthesizer. |