发明名称 METHOD AND APPARATUS FOR BILINGUAL WORD ALIGNMENT, METHOD AND APPARATUS FOR TRAINING BILINGUAL WORD ALIGNMENT MODEL
摘要 The present invention provides method and apparatus for bilingual word alignment, method and apparatus for training bilingual word alignment model. The method for bilingual word alignment, comprising: training a bilingual word alignment model using a word-aligned labeled bilingual corpus; word-aligning a plurality of bilingual sentence pairs in a unlabeled bilingual corpus using said bilingual word alignment model; determining whether the word alignment of each of said plurality of bilingual sentence pairs is correct, and if it is correct, adding the bilingual sentence pair into the labeled bilingual corpus and removing the bilingual sentence pair from the unlabeled bilingual corpus; retraining the bilingual word alignment model using the expanded labeled bilingual corpus; and re-word-aligning the remaining bilingual sentence pairs in the unlabeled bilingual corpus using the retrained bilingual word alignment model.
申请公布号 US2007203689(A1) 申请公布日期 2007.08.30
申请号 US20070678364 申请日期 2007.02.23
申请人 KABUSHIKI KAISHA TOSHIBA 发明人 WU HUA;WANG HAIFENG;LIU ZHANYI
分类号 G06F17/28 主分类号 G06F17/28
代理机构 代理人
主权项
地址