发明名称 System and method for synthetically generated speech describing media content
摘要 Disclosed herein are systems, methods, and computer readable-media for providing an automatic synthetically generated voice describing media content, the method comprising receiving one or more pieces of metadata for a primary media content, selecting at least one piece of metadata for output, and outputting the at least one piece of metadata as synthetically generated speech with the primary media content. Other aspects of the invention involve alternative output, output speech simultaneously with the primary media content, output speech during gaps in the primary media content, translate metadata in foreign language, tailor voice, accent, and language to match the metadata and/or primary media content. A user may control output via a user interface or output may be customized based on preferences in a user profile.
申请公布号 US8831948(B2) 申请公布日期 2014.09.09
申请号 US200812134714 申请日期 2008.06.06
申请人 AT&T Intellectual Property I, L.P. 发明人 Roberts Linda;Nguyen Hong Thi;Schroeter Horst J.
分类号 G10L13/00;G10L13/08;G10L13/04 主分类号 G10L13/00
代理机构 代理人
主权项 1. A method comprising: receiving a metadata request from a user, wherein the metadata request is associated with a primary media content and comprises a gesture; selecting a piece of metadata for output to yield selected metadata, the selected metadata being responsive to the metadata request regarding the primary media content and received during presentation of the primary media content; and outputting, with the primary media content, the selected metadata as synthetically generated speech, the synthetically generated speech having an accent, wherein the accent is selected from a plurality of accents based on the selected metadata.
地址 Atlanta GA US