发明名称 Video generation based on text
摘要 Techniques for generating a video sequence of a person based on a text sequence, are disclosed herein. Based on the received text sequence, a processing device generates the video sequence of a person to simulate visual and audible emotional expressions of the person, including using an audio model of the person's voice to generate an audio portion of the video sequence. The emotional expressions in the visual portion of the video sequence are simulated based a priori knowledge about the person. For instance, the a priori knowledge can include photos or videos of the person captured in real life.
申请公布号 US9082400(B2) 申请公布日期 2015.07.14
申请号 US201213464915 申请日期 2012.05.04
申请人 SEYYER, INC. 发明人 Rezvani Behrooz;Rouhi Ali
分类号 G10L13/00;G10L13/08;G06T13/40;H04M1/725;G10L13/10 主分类号 G10L13/00
代理机构 Perkins Coie LLP 代理人 Perkins Coie LLP
主权项 1. A method comprising: inputting a text sequence at a processing device; and generating, by the processing device, a video sequence of a person based on the text sequence to simulate visual and audible emotional expressions of the person, including using an audio model of the person's voice to generate an audio portion of the video sequence, said generating being based on a machine learning analysis of a real life video sequence of the person.
地址 San Ramon CA US