发明名称 Method and apparatus for synthesizing realistic animations of a human speaking using a computer
摘要 A method and apparatus for synthesizing speech or facial movements to match selected speech sequences. A videotape of an arbitrary text sequence is obtained including a plurality of images of a user speaking various sequences. Video images corresponding to specific spoken phonemes are obtained. A video frame is digitized from that sequence which represents the extreme of mouth motion and shape. This is used to create a database of images of different facial positions relative to spoken phonemes and diphthongs. An audio speech sequence is then used as the element to which a video sequence will be matched. The audio sequence is analyzed to determine spoken phoneme sequences and relative timings. The database is used to obtain images for each of these phonemes and these times, and morphing techniques are used to create transitions between the images. Different parts of the images can be processed in different ways to make a more realistic speech pattern.
申请公布号 AU4411596(A) 申请公布日期 1996.06.19
申请号 AU19960044115 申请日期 1995.11.30
申请人 CALIFORNIA INSTITUTE OF TECHNOLOGY 发明人 KENNETH C SCOTT;MATTHEW C YEATES;DAVID S KAGELS;STEPHEN HILARY WATSON
分类号 G06T13/20;G06T13/40;G10L13/04;G10L21/06 主分类号 G06T13/20
代理机构 代理人
主权项
地址