发明名称 |
SYSTEM AND METHOD FOR GENERATING CUSTOMIZED TEXT-TO-SPEECH VOICES |
摘要 |
A system and method are disclosed for generating customized text-to-speech voices for a particular application. The method comprises generating a custom text-to-speech voice by selecting a voice for generating a custom text-to-speech voice associated with a domain, collecting text data associated with the domain from a pre-existing text data source and using the collected text data, generating an in-domain inventory of synthesis speech units by selecting speech units appropriate to the domain via a search of a pre-existing inventory of synthesis speech units, or by recording the minimal inventory for a selected level of synthesis quality. The text-to-speech custom voice for the domain is generated utilizing the in-domain inventory of synthesis speech units. Active learning techniques may also be employed to identify problem phrases wherein only a few minutes of recorded data is necessary to deliver a high quality TTS custom voice. |
申请公布号 |
US2016093287(A1) |
申请公布日期 |
2016.03.31 |
申请号 |
US201514965251 |
申请日期 |
2015.12.10 |
申请人 |
AT&T Intellectual Property II, L.P. |
发明人 |
BANGALORE Srinivas;FENG Junlan;GILBERT Mazin;SCHROETER Juergen;SYRDAL Ann K.;SCHULZ David |
分类号 |
G10L13/033;G10L15/197 |
主分类号 |
G10L13/033 |
代理机构 |
|
代理人 |
|
主权项 |
1. A method comprising:
receiving a selection of an animated character to guide a user on a website; collecting text data from a pre-existing text data source, to yield collected text data, wherein the text data is associated with a domain of the website; selecting synthesis speech units specific to the domain from a pre-existing inventory of synthesis speech units using the collected text data; caching the synthesis speech units specific to the domain as an in-domain inventory of synthesis speech units; and generating, via a processor, a custom text-to-speech voice for a specific task in the domain utilizing the in-domain inventory of synthesis speech units, wherein the animated character will use the custom text-to-speech voice. |
地址 |
Atlanta GA US |