摘要 |
<p>The embodiments of the present invention relate to a method, device and system for processing video/audio information in a video conference.The method includes: receiving data code streams transmitted from at least two conference terminals, and decoding the data code streams to obtain at least two channels of decoded information; when determining that sign language information exists in the at least two channels of decoded information, converting the sign language into voice information, and performing voice synthesis on the converted voice information to generate synthetic voice information; performing audio mixing on the generated synthetic voice information with other decoded audio information; and transmitting the audio-mixed audio information to at least two conference sites.</p> |