System and method of synthetic voice generation and modification,申请号US201615049592-传众专利搜索

发明名称	System and method of synthetic voice generation and modification
摘要	Disclosed herein are systems, methods, and non-transitory computer-readable storage media for generating a synthetic voice. A system configured to practice the method combines a first database of a first text-to-speech voice and a second database of a second text-to-speech voice to generate a combined database, selects from the combined database, based on a policy, voice units of a phonetic category for the synthetic voice to yield selected voice units, and synthesizes speech based on the selected voice units. The system can synthesize speech without parameterizing the first text-to-speech voice and the second text-to-speech voice. A policy can define, for a particular phonetic category, from which text-to-speech voice to select voice units. The combined database can include multiple text-to-speech voices from different speakers. The combined database can include voices of a single speaker speaking in different styles. The combined database can include voices of different languages.
申请公布号	US9495954(B2)	申请公布日期	2016.11.15
申请号	US201615049592	申请日期	2016.02.22
申请人	AT&T Intellectual Property I, L.P.	发明人	Conkie Alistair D.;Syrdal Ann K.
分类号	G10L13/027;G10L13/047;G10L13/06;G10L13/04;H04B7/04;H04B7/06;H04W72/04;G10L25/63	主分类号	G10L13/027
代理机构		代理人
主权项	1. A method comprising: storing, in a database, voice data according to user emotions; identifying, from user speech, a user emotion; identifying, via at least one processor and according to the user emotion, a first portion of the voice data, wherein the first portion of the voice data comprises a first emotional content for a first speaker; identifying, via the at least one processor and according to the user emotion, a second portion of the voice data, wherein the second portion of the voice data comprises a second emotional content for a second speaker; and synthesizing synthesized speech using the first portion of the voice data and the second portion of the voice data.
地址	Atlanta GA US