发明名称 Coarticulation method for audio-visual text-to-speech synthesis
摘要 A method for generating animated sequences of talking heads in text-to-speech applications wherein a processor samples a plurality of frames comprising image samples. The processor reads first data comprising one or more parameters associated with noise-producing orifice images of sequences of at least three concatenated phonemes which correspond to an input stimulus. The processor reads, based on the first data. second data comprising images of a noise-producing entity. The processor generates an animated sequence of the noise-producing entity.
申请公布号 US7630897(B2) 申请公布日期 2009.12.08
申请号 US20080123154 申请日期 2008.05.19
申请人 AT&T INTELLECTUAL PROPERTY II, L.P. 发明人 COSATTO ERIC;GRAF HANS PETER;SCHROETER JUERGEN
分类号 G10L13/00;G10L13/04;G10L21/06 主分类号 G10L13/00
代理机构 代理人
主权项
地址