发明名称 Audio-visual communication system having integrated perceptual speech and video coding.
摘要 Disclosed is a low bit rate audio and video communication system which employs an integrated encoding system that dynamically allocates available bits among the audio and video signals to be encoded based on the content of the audio and video information and the manner in which the audio and video information will be perceived by a viewer. A dynamic bit allocation and encoding process will evaluate the current content of the audio and video information and allocate the available bits among the audio and video signals to be encoded. In addition, an appropriate audio encoding technique is dynamically selected based on the current content of the audio signal. A face location detection subroutine will detect and model the location of faces in each video frame, in order that the facial regions may be more accurately encoded than other portions of the video frame. A lip motion detection subroutine will detect the location and movement of the lips of a person present in a video scene, in order to determine when a person is speaking and to encode the lip regions more accurately. The audio and video signals generated by a second party to a communication are monitored to determine if the second party is paying attention to the audio and video information transmitted by the first party to the communication. <IMAGE>
申请公布号 EP0676899(A3) 申请公布日期 1997.11.19
申请号 EP19950302111 申请日期 1995.03.29
申请人 AT&T CORP. 发明人 ZHOU, YONG
分类号 G06K9/00;G06T9/00;G10L19/00;H04N7/26;H04N7/50;H04N21/2368;H04N21/434 主分类号 G06K9/00
代理机构 代理人
主权项
地址