发明名称 SYSTEM AND METHOD FOR TRANSLATING REAL-TIME SPEECH USING SEGMENTATION BASED ON CONJUNCTION LOCATIONS
摘要 A system, method and computer-readable storage device which balance latency and accuracy of machine translations by segmenting the speech upon locating a conjunction. The system, upon receiving speech, will buffer speech until a conjunction is detected. Upon detecting a conjunction, the speech received until that point is segmented. The system then continues performing speech recognition on the segment, searching for the next conjunction, while simultaneously initiating translation of the segment. Upon translating the segment, the system converts the translation to a speech output, allowing a user to hear an audible translation of the speech originally heard.
申请公布号 US2015134320(A1) 申请公布日期 2015.05.14
申请号 US201314080361 申请日期 2013.11.14
申请人 AT&T Intellectual Property I, L.P. 发明人 RANGARAJAN SRIDHAR Vivek Kumar;BANGALORE Srinivas;CHEN John
分类号 G06F17/28;G10L15/00 主分类号 G06F17/28
代理机构 代理人
主权项 1. A method comprising: receiving speech in a first language, the speech having no accompanying speech transcription; as the speech is being received, performing, via a processor, a speech recognition process until a conjunction is recognized by the speech recognition process; and upon identifying the conjunction: segmenting the speech by generating a speech segment, the speech segment comprising the speech from a first location in the speech to the conjunction;performing a translation of the speech segment from the first language to a second language, to yield a translated speech segment;generating translated speech using the translated speech segment; andoutputting the translated speech.
地址 Atlanta GA US