发明名称 System and Method for Cloud-Based Text-to-Speech Web Services
摘要 Disclosed herein are systems, methods, and non-transitory computer-readable storage media for generating speech. One variation of the method is from a server side, and another variation of the method is from a client side. The server side method, as implemented by a network-based automatic speech processing system, includes first receiving, from a network client independent of knowledge of internal operations of the system, a request to generate a text-to-speech voice. The request can include speech samples, transcriptions of the speech samples, and metadata describing the speech samples. The system extracts sound units from the speech samples based on the transcriptions and generates an interactive demonstration of the text-to-speech voice based on the sound units, the transcriptions, and the metadata, wherein the interactive demonstration hides a back end processing implementation from the network client. The system provides access to the interactive demonstration to the network client.
申请公布号 US2015221298(A1) 申请公布日期 2015.08.06
申请号 US201514684893 申请日期 2015.04.13
申请人 AT&T Intellectual Property I, L.P. 发明人 BEUTNAGEL Mark Charles;CONKIE Alistair D.;KIM Yeon-Jun;SCHROETER Horst Juergen
分类号 G10L13/04 主分类号 G10L13/04
代理机构 代理人
主权项 1. A method comprising: receiving, at a network-based automatic speech processing system and from a network client not having access to information of internal operations of the network-based automatic speech processing system, a request to generate a text-to-speech voice, the request comprising a transcription; extracting sound units from speech samples based on the transcription; generating a demonstration of the text-to-speech voice based only on the sound units and the transcriptions, wherein the text-to-speech voice is language agnostic; and providing access to the demonstration to the network client.
地址 Atlanta GA US