发明名称 Sampling and optimization in phrase-based machine translation using an enriched language model representation
摘要 Rejection sampling is performed to acquire at least one target language translation for a source language string s in accordance with a phrase-based statistical translation model p(x) = p(t, a | s ) where t is a candidate translation, a is a candidate alignment comprising a biphrase sequence generating the candidate translation t, and x is a sequence representing the candidate alignment a. The rejection sampling uses a proposal distribution comprising a weighted finite state automaton (WFSA) q ( n ) that is refined responsive to rejection of a sample x * obtained in a current iteration of the rejection sampling to generate a refined WFSA q (n+1) for use in a next iteration of the rejection sampling. The refined WFSA q (n+1) is selected to satisfy the criteria p(x) ‰¤ q ( n +1) (x) ‰¤ q ( n ) (x) for all x ˆˆ X and q ( n +1) ( x *) < q ( n ) ( x *) where the space X is the set of sequences x corresponding to candidate alignments a that generate candidate translations t for the source language string s.
申请公布号 EP2759945(A2) 申请公布日期 2014.07.30
申请号 EP20140151745 申请日期 2014.01.20
申请人 XEROX CORPORATION 发明人 DYMETMAN, MARC;AZIZ, WILKER;VENKATAPATHY, SRIRAM
分类号 G06F17/28 主分类号 G06F17/28
代理机构 代理人
主权项
地址