发明名称 Acoustic-assisted image processing
摘要 <p>Acoustic-assisted image processing is achieved, in accordance with the invention by a novel method and apparatus in which an audio signal is sampled at an audio-domain sampling rate; a first viseme sequence is generated at a first rate in response to the sampled audio signal, the first rate corresponding to an audio-domain sampling rate; the first viseme sequence is transformed into a second viseme sequence at second rate using a predetermined set of transformation criteria, the second rate corresponding to a video-domain frame rate; and an image is processed in response to the second viseme sequence. In an illustrative example of the invention, a video image of a face of a human speaker is animated using a three-dimensional wire-frame facial model upon which a surface texture is mapped. The three-dimensional wire-frame facial model is structurally deformed in response to a rate-transformed viseme sequence extracted from a speech signal so that the mouth region of the video image moves in correspondence with the speech. Advantageously, the animation is accomplished in real time, works with any speaker, and has no limitations on vocabulary, nor requires any special action on the part of the speaker. <IMAGE></p>
申请公布号 EP0710929(A2) 申请公布日期 1996.05.08
申请号 EP19950307884 申请日期 1995.11.06
申请人 AT&T CORP. 发明人 CHEN, HOMER H.;CHOU, WU
分类号 G10L15/00;G06T11/60;G06T13/20;G06T13/40;H04N7/26;H04N7/52;(IPC1-7):G06T15/70 主分类号 G10L15/00
代理机构 代理人
主权项
地址