<p>An audio synthesis device capable of embedding additional information which cannot be modified into a synthesis audio without causing audio quality deterioration or band limit includes: a language processing unit (201) for generating synthesized audio generation information required for creating a synthesized audio according to a character string; a prosody generation unit (202) for generating an audio prosody information according to the synthesized audio generation information; and a waveform generation unit (203) for synthesizing audio according to the prosody information. The prosody generation unit (202) embeds code information as watermark information into the prosody information in the area of a predetermined time width not exceeding the phoneme length containing a phoneme boundary.</p>
申请公布号
WO2005119650(A1)
申请公布日期
2005.12.15
申请号
WO2005JP06681
申请日期
2005.04.05
申请人
MATSUSHITA ELECTRIC INDUSTRIAL CO., LTD.;KATO, YUMIKO;KAMAI, TAKAHIRO