摘要 |
PROBLEM TO BE SOLVED: To generate proper rhythm information for synthesizing a natural voice of an arbitrary text by supplying the rhythm state and rhythm parameter of the rhythm structure of a word according to the word, syllables, and clock synchronism. SOLUTION: The neural network is divided functionally into two parts. The 1st part consists of the 1st part of an input layer and a 1st hidden layer, and its output is all fed back to its input. This is judged to be a rhythm model for searching for the rhythm structure of high word level of a voice of a human language by using only linguistic features of the voice of the human language. This operates in synchronism with the word and clock and generates output representing the rhythm state of the rhythm structure of the current word. The 2nd part consists of the 2nd part of the input layer, a 2nd hidden layer, and an output layer. This is an actual rhythm parameter generating device. |