发明名称 |
System and method of synthetic voice generation and modification |
摘要 |
Disclosed herein are systems, methods, and non-transitory computer-readable storage media for generating a synthetic voice. A system configured to practice the method combines a first database of a first text-to-speech voice and a second database of a second text-to-speech voice to generate a combined database, selects from the combined database, based on a policy, voice units of a phonetic category for the synthetic voice to yield selected voice units, and synthesizes speech based on the selected voice units. The system can synthesize speech without parameterizing the first text-to-speech voice and the second text-to-speech voice. A policy can define, for a particular phonetic category, from which text-to-speech voice to select voice units. The combined database can include multiple text-to-speech voices from different speakers. The combined database can include voices of a single speaker speaking in different styles. The combined database can include voices of different languages. |
申请公布号 |
US9495954(B2) |
申请公布日期 |
2016.11.15 |
申请号 |
US201615049592 |
申请日期 |
2016.02.22 |
申请人 |
AT&T Intellectual Property I, L.P. |
发明人 |
Conkie Alistair D.;Syrdal Ann K. |
分类号 |
G10L13/027;G10L13/047;G10L13/06;G10L13/04;H04B7/04;H04B7/06;H04W72/04;G10L25/63 |
主分类号 |
G10L13/027 |
代理机构 |
|
代理人 |
|
主权项 |
1. A method comprising:
storing, in a database, voice data according to user emotions; identifying, from user speech, a user emotion; identifying, via at least one processor and according to the user emotion, a first portion of the voice data, wherein the first portion of the voice data comprises a first emotional content for a first speaker; identifying, via the at least one processor and according to the user emotion, a second portion of the voice data, wherein the second portion of the voice data comprises a second emotional content for a second speaker; and synthesizing synthesized speech using the first portion of the voice data and the second portion of the voice data. |
地址 |
Atlanta GA US |