发明名称 |
SYSTEMS AND METHODS FOR IDENTIFYING PARALLEL DOCUMENTS AND SENTENCE FRAGMENTS IN MULTILINGUAL DOCUMENT COLLECTIONS |
摘要 |
<p>Systems, computer programs, and methods for identifying parallel documents and/or fragments in a bilingual collection are provided. The method for identifying parallel sub-sentential fragments in a bilingual collection comprises translating a source document from a bilingual collection. The method further includes querying a target library associated with the bilingual collection using the translated source document, and identifying one or more target documents based on the query. Subsequently, a source sentence associated with the source document is aligned to one or more target sentences associated with the one or more target documents. Finally, the method includes determining whether a source fragment associated with the source sentence comprises a parallel translation of a target fragment associated with the one or more target sentences.</p> |
申请公布号 |
WO2007126447(A3) |
申请公布日期 |
2008.04.17 |
申请号 |
WO2007US00853 |
申请日期 |
2007.01.12 |
申请人 |
UNIVERSITY OF SOUTHERN CALIFORNIA;MARCU, DANIEL;MUNTEANU, DRAGOS, STEFAN |
发明人 |
MARCU, DANIEL;MUNTEANU, DRAGOS, STEFAN |
分类号 |
G06F7/00 |
主分类号 |
G06F7/00 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|