摘要 |
PROBLEM TO BE SOLVED: To provide a TV phone apparatus that can send a substitute image having natural expression suitable for the contents of voice. SOLUTION: When a transmission target is set to be a substitute image, a voice recognition section 10 carries out specific voice recognition processing to the voice of a speaker that is collected by a microphone 2a in a handset 2, and specifies pronunciation contents (six kinds of Japanese vowels and a Japanese snapping sound). A substitute image generation section 14 selects the substitute image having a mouth shape corresponding to the pronunciation contents specified by the voice recognition section 10 out of a plurality of kinds of substitute images stored into a substitute image memory 12. The selected substitute image is sent to a listener with the voice of the speaker. |