发明名称 Personalized text-to-speech synthesis and personalized speech feature extraction
摘要 A personalized text-to-speech synthesizing device includes: a personalized speech feature library creator, configured to recognize personalized speech features of a specific speaker by comparing a random speech fragment of the specific speaker with preset keywords, thereby to create a personalized speech feature library associated with the specific speaker, and store the personalized speech feature library in association with the specific speaker; and a text-to-speech synthesizer, configured to perform a speech synthesis of a text message from the specific speaker, based on the personalized speech feature library associated with the specific speaker and created by the personalized speech feature library creator, thereby to generate and output a speech fragment having pronunciation characteristics of the specific speaker. A personalized speech feature library of a specific speaker is established without a deliberate training process, and a text is synthesized into personalized speech with the speech characteristics of the speaker.
申请公布号 US8655659(B2) 申请公布日期 2014.02.18
申请号 US20100855119 申请日期 2010.08.12
申请人 WANG QINGFANG;HE SHOUCHUN;SONY CORPORATION;SONY MOBILE COMMUNICATIONS AB 发明人 WANG QINGFANG;HE SHOUCHUN
分类号 G10L15/00;G10L13/00;G10L13/02;G10L13/033;G10L21/00 主分类号 G10L15/00
代理机构 代理人
主权项
地址