摘要 |
An information transmission device (1) which analyzes a prosody of a speaker and provides an utterance in accordance with the prosody of the speaker, and which has a microphone (M) detecting a sound signal of the speaker, a feature value extraction unit (10) extracting a feature value of the prosody of the speaker based on the sound signal detected by the microphone (M), a voice synthesis unit (30) synthesizing a voice signal to be uttered so that the voice signal has the same feature value as the diction of the speaker, based on the feature value extracted by the feature extraction unit (10), and a voice output unit (40) performing an utterance based on the voice signal synthesized by the voice synthesis unit (30). Phoneme recognition is used for analyzing the input signal. Conveying of emotions by means of colors is also used. |