发明名称 PHOTO-REALISTIC SYNTHESIS OF THREE DIMENSIONAL ANIMATION WITH FACIAL FEATURES SYNCHRONIZED WITH SPEECH
摘要 Dynamic texture mapping is used to create a photorealistic three dimensional animation of an individual with facial features synchronized with desired speech. Audiovisual data of an individual reading a known script is obtained and stored in an audio library and an image library. The audiovisual data is processed to extract feature vectors used to train a statistical model. An input audio feature vector corresponding to desired speech with which the animation will be synchronized is provided. The statistical model is used to generate a trajectory of visual feature vectors that corresponds to the input audio feature vector. These visual feature vectors are used to identify a matching image sequence from the image library. The resulting sequence of images, concatenated from the image library, provides a photorealistic image sequence with facial features, such as lip movements, synchronized with the desired speech. This image sequence is applied to the three-dimensional model.
申请公布号 US2012280974(A1) 申请公布日期 2012.11.08
申请号 US201113099387 申请日期 2011.05.03
申请人 WANG LIJUAN;SOONG FRANK;HUO QIANG;ZHANG ZHENGYOU;MICROSOFT CORPORATION 发明人 WANG LIJUAN;SOONG FRANK;HUO QIANG;ZHANG ZHENGYOU
分类号 G06T13/40;G06T15/00 主分类号 G06T13/40
代理机构 代理人
主权项
地址