摘要 |
PROBLEM TO BE SOLVED: To achieve a synchronizing system between moving picture and synthesized sound by defining an interface between TTS(text-to-speech conversion system) and information in which successive movements of lips are made to be routines and regularized in an event unit. SOLUTION: A multiple medium information accepted by a multiple medium information input part 10 comprises text, moving picture, and synchronizing information. A language processing part 12 transforms a transmitted text into phoneme rows, and assumes each phonemic duration meaning meter information, a pitch value, and an energy value according to a meter control rule led from syntax construction information. A meter processing part 13 calculates phoneme parameters such as duration of phoneme, pitch outline, energy outline, pause position, length, etc. A synchronization adjuster 14 adjusts the duration of each phoneme by using the synchronizing information sent from a multiple medium distributor 11 in order to synchronize synthesized sound with moving picture. The adjusted information is transmitted to a signal processing part 15, and selects necessary data for synthesis from a synthesis unit data base 1, and refers the data to the phoneme information for correction, and then, generates synthesized sound for outputting it. |