发明名称 Auto-translation for multi user audio and video
摘要 The disclosed subject matter provides a system, computer readable storage medium, and a method providing an audio and textual transcript of a communication. A conferencing services may receive audio or audio visual signals from a plurality of different devices that receive voice communications from participants in a communication, such as a chat or teleconference. The audio signals representing voice (speech) communications input into respective different devices by the participants. A translation services server may receive over a separate communication channel the audio signals for translation into a second language. As managed by the translation services server, the audio signals may be converted into textual data. The textual data may be translated into text of different languages based the language preferences of the end user devices in the teleconference. The translated text may be further translated into audio signals.
申请公布号 US9110891(B2) 申请公布日期 2015.08.18
申请号 US201113316689 申请日期 2011.12.12
申请人 Google Inc. 发明人 Kristjansson Trausti;Huang John;Lin Yu-Kuan;Tyan Hung-ying;Uszkoreit Jakob David;Estelle Joshua James;Wang Chung-yi;Buryak Kirill;Konishi Yusuke
分类号 G06F17/28;G10L15/26 主分类号 G06F17/28
代理机构 Remarck Law Group PLC 代理人 Remarck Law Group PLC
主权项 1. A computer-implemented method, comprising: receiving, at a first end user device associated with a user, an audio data signal from a conferencing services server, the audio data signal representing a communication in a first spoken language received at a second end user device that is intended for the user; determining, at the first end user device, the first spoken language of the communication represented by the audio data signal by receiving a user preferences signal, the user preferences signal comprising an identifier embedded in the audio data signal received from the conferencing services server; determining, at the first end user device, language preferences of the user of the first end user device; comparing, at the first end user device, the determined first spoken language with the language preferences of the user of the at the first end user device; and when the determined first spoken language does not match the language preferences: establishing, at the first end user device, a communication channel with a translation services server,providing, from the first end user device, the audio data signal to the translation services server,providing, from the first end user device, the language preferences of the user to the translation services server, andreceiving, at the first end user device, a translated audio data signal from the translation services server, the translated audio data signal representing the communication in a second spoken language corresponding to the language preferences and different from the first spoken language.
地址 Mountain View CA US