摘要 |
PROBLEM TO BE SOLVED: To execute natural speech conversion even if target speeches do not exist in a corresponding play portion in the case where the generating source of the speeches is entirely different like in the case where the tones of a musical instrument are converted to resemble the previously determined singing of a singer. SOLUTION: The sinusoidal wave components and residual components of the input speeches are deformed in accordance with the sinusoidal wave component and residual components extracted from the input speeches and the target sinusoidal wave components and target residual components extracted from the target speeches and the converted speeches are formed by synthesizing the components. In this case, when the portions, which are the portions of the target voices and are required to be made correspondent to the input speeches, are silent portions, the sinusoidal wave components and residual components of the input speeches are deformed in accordance with the preset silent part sinusoidal wave components and silent part residual components in place of the target sinusoidal wave components and target residual components and, therefore, the natural speech conversion may be executed even when the generating source of the speech is entirely heterogeneous and the target speeches do not exist in the corresponding playing portions. |