发明名称 |
DISCOVERY OF PARALLEL TEXT PORTIONS IN COMPARABLE COLLECTIONS OF CORPORA AND TRAINING USING COMPARABLE TEXTS |
摘要 |
A translation training device which extracts from two nonparallel Corpora a set of parallel sentences. The system finds parameters between different sentences or phrases, in order to find parallel sentences. The parallel sentences are then used for training a data-driven machine translation system. The process can be applied repetitively until sufficient data is collected or until the performance of the translation system stops improving. |
申请公布号 |
WO2005094509(A3) |
申请公布日期 |
2007.10.11 |
申请号 |
WO2005US09770 |
申请日期 |
2005.03.23 |
申请人 |
UNIVERSITY OF SOUTHERN CALIFORNIA;MUNTEANU, DRAGOS, STEFAN;MARCU, DANIEL |
发明人 |
MUNTEANU, DRAGOS, STEFAN;MARCU, DANIEL |
分类号 |
G06F17/28;G06F17/20;G06F17/21 |
主分类号 |
G06F17/28 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|