发明名称 COMBINED TRANSLATION MODEL FORMING DEVICE, TEXT CLUSTERING DEVICE, AND METHODS AND PROGRAM THEREFOR
摘要 <p><P>PROBLEM TO BE SOLVED: To form a high-accuracy translation model even when translated texts in a set of texts having a semantic correspondence are not sufficiently similar to each other, and enable proper clustering. <P>SOLUTION: A semantic correspondence replacement unit 2 forms a semantic correspondence replacement text group set 1' by mutually replacing semantic correspondences in a semantically corresponding text group set 1, a translation probability calculation unit 3 calculates word-to-word translation probabilities in the semantically corresponding text group set 1 and the semantic correspondence replacement text group set 1', to form a correspondence non-replacement translation model 4 and a correspondence replacement translation model 5. A translation probability composition unit 6 calculates a translation probability that each word in the semantically corresponding text group set 1 is translated to the other word from the translation models 4 and 5 to form a combined translation model 7. The combined translation model 7 is used to cluster a group of input texts. <P>COPYRIGHT: (C)2011,JPO&INPIT</p>
申请公布号 JP2010267200(A) 申请公布日期 2010.11.25
申请号 JP20090119886 申请日期 2009.05.18
申请人 NIPPON TELEGR & TELEPH CORP <NTT> 发明人 NISHIKAWA HITOSHI;HASEGAWA TAKAAKI;IMAMURA KENJI
分类号 G06F17/28;G06F17/30 主分类号 G06F17/28
代理机构 代理人
主权项
地址