发明名称 Confidence-driven rewriting of source texts for improved translation
摘要 A method for rewriting source text includes receiving source text including a source text string in a first natural language. The source text string is translated with a machine translation system to generate a first target text string in a second natural language. A translation confidence for the source text string is computed, based on the first target text string. At least one alternative text string is generated, where possible, in the first natural language by automatically rewriting the source string. Each alternative string is translated to generate a second target text string in the second natural language. A translation confidence is computed for the alternative text string based on the second target string. Based on the computed translation confidences, one of the alternative text strings may be selected as a candidate replacement for the source text string and may be proposed to a user on a graphical user interface.
申请公布号 US2014358519(A1) 申请公布日期 2014.12.04
申请号 US201313908157 申请日期 2013.06.03
申请人 Xerox Corporation 发明人 Mirkin Shachar;Venkatapathy Sriram;Dymetman Marc
分类号 G06F17/28 主分类号 G06F17/28
代理机构 代理人
主权项 1. A method for rewriting source text, comprising: receiving source text comprising at least one source text string in a first natural language; with a processor, for each of the at least one source text string: translating the source text string with a machine translation system to generate a first target text string in a second natural language;computing a first translation confidence for the source text string based on at least one feature that is based on at least one of the source text string and the first target text string;providing for generating at least one alternative text string in the first natural language, the generating comprising automatically rewriting the source text string; andfor each of the at least one alternative text string: translating the alternative text string with the machine translation system to generate a second target text string in the second natural language; andcomputing a second translation confidence for the alternative text string based on at least one feature that is based on at least one of the alternative text string and the second target text string; andbased on the computed first and second translation confidences, providing for selecting one of the at least one alternative text strings as a replacement for the source text string in the source text.
地址 Norwalk CT US