摘要 |
<P>PROBLEM TO BE SOLVED: To provide an apparatus for integrally controlling a video signal and a voice signal capable of creating a space with a feeling of real life wherein a receiver side can eliminate deviation between a talker at a transmission side and voice or sound uttered by the talker, so that the apparatus can reproduce a state of the transmission side talker uttering the voice or sound as it is. <P>SOLUTION: A face region detection section 108 calculates face region positional information of a person from video information. A sound receiving direction determining section 105 prescribes an existing direction of a transmission side conference participant on the basis of the face region positional information and a zoom magnification of a camera 103 and the direction of the camera. A sound receiving section 104 acquires sound information from the prescribed existence direction. A sound receiving reproducing section 109 forms an image of the sound information around a face region of the conference participant at the transmission side displayed on a display apparatus 107 of a receiver side terminal on the basis of the face region positional information. <P>COPYRIGHT: (C)2007,JPO&INPIT |