发明名称 Statistical translation system and method for fast sense disambiguation and translation of large corpora using fertility models and sense models
摘要 A system and method for translating a series of source words in a first language to a series of target words in a second language is provided. The system includes an input device for inputting the series of source words. A fertility hypothesis generator operatively coupled to the input device generates at least one fertility hypotheses for a fertility of a source word, based on the source word and a context of the source word. A sense hypothesis generator operatively coupled to the input device generates sense hypotheses for a translation of the source word, based on the source word and the context of the source word. A fertility model operatively coupled to the fertility hypothesis generator determines a probability of the fertility of the source word, based on the source word and the context of the source word. A sense model operatively coupled to the sense hypothesis generator determines a probability of a target word being a correct translation of the source word, based on the source word and the context of the source word. A decoder operatively coupled to the fertility and sense models for generating a list of target words for the translation of the source word, based on the probability calculated by the fertility model and the probability calculated by the sense model.
申请公布号 US6092034(A) 申请公布日期 2000.07.18
申请号 US19980123166 申请日期 1998.07.27
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 MCCARLEY, JEFFREY SCOTT;ROUKOS, SALIM
分类号 G06F17/27;G06F17/28;(IPC1-7):G06F17/28 主分类号 G06F17/27
代理机构 代理人
主权项
地址