摘要 |
PURPOSE: To constitute a speech file which is highly comprehensive for both context and rhythm in the speech file constitution system which is applied to waveform synthesis for obtaining a synthesized sound by connecting phoneme waveforms. CONSTITUTION: First, waveform data having the same phoneme labels are segmented as an initial cluster 110 form speech waveform data stored in a speech data base. Next, clustering based on context is performed in a specific characteristic parameter space for the initial cluster 110. Then, clustering in a specific rhythm pattern space is performed for respective clusters 130, 140, and 150 obtained by the context clustering. Lastly, waveform data which are closest to the centroids of fine clusters 131-133, 141-143, and 151-153 obtained by the rhythm clustering are extracted from the fine clusters 131-133, 141-143, and 151-153 and registered in the speech file 160 for synthesis. |