发明名称 |
Statistical translation system and method for fast sense disambiguation and translation of large corpora using fertility models and sense models |
摘要 |
A system and method for translating a series of source words in a first language to a series of target words in a second language is provided. The system includes an input device for inputting the series of source words. A fertility hypothesis generator operatively coupled to the input device generates at least one fertility hypotheses for a fertility of a source word, based on the source word and a context of the source word. A sense hypothesis generator operatively coupled to the input device generates sense hypotheses for a translation of the source word, based on the source word and the context of the source word. A fertility model operatively coupled to the fertility hypothesis generator determines a probability of the fertility of the source word, based on the source word and the context of the source word. A sense model operatively coupled to the sense hypothesis generator determines a probability of a target word being a correct translation of the source word, based on the source word and the context of the source word. A decoder operatively coupled to the fertility and sense models for generating a list of target words for the translation of the source word, based on the probability calculated by the fertility model and the probability calculated by the sense model.
|
申请公布号 |
US6092034(A) |
申请公布日期 |
2000.07.18 |
申请号 |
US19980123166 |
申请日期 |
1998.07.27 |
申请人 |
INTERNATIONAL BUSINESS MACHINES CORPORATION |
发明人 |
MCCARLEY, JEFFREY SCOTT;ROUKOS, SALIM |
分类号 |
G06F17/27;G06F17/28;(IPC1-7):G06F17/28 |
主分类号 |
G06F17/27 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|