摘要 |
PROBLEM TO BE SOLVED: To allow a user to optionally select various synthetic voices by preparing a voice dictionary generated from the voices of a specific person, connecting and interpolating voice element pieces based on the extracted phoneme code lines, and generating phoneme series. SOLUTION: When the voice of a person is inputted, a speech recognition section 101 recognizes the voice and detects voice information. The detected voice information is analyzed by a recognized voice analysis section 102, and vocalization information is extracted. A voice synthesis section 103 synthesizes the voice signal based on the vacalization information. The voice signal is voice output-converted and outputted. A user can synthesize a voice different from the tone quality of the original person based on the words spoken by the person according to the atmosphere and situation with a voice dictionary independently having features based on the voices of the person and various information. An image matched with the background sound and voice can be selected, and the synthesis of various voices can be enjoyed. |