发明名称 |
COMBINED TRANSLATION MODEL FORMING DEVICE, TEXT CLUSTERING DEVICE, AND METHODS AND PROGRAM THEREFOR |
摘要 |
<p><P>PROBLEM TO BE SOLVED: To form a high-accuracy translation model even when translated texts in a set of texts having a semantic correspondence are not sufficiently similar to each other, and enable proper clustering. <P>SOLUTION: A semantic correspondence replacement unit 2 forms a semantic correspondence replacement text group set 1' by mutually replacing semantic correspondences in a semantically corresponding text group set 1, a translation probability calculation unit 3 calculates word-to-word translation probabilities in the semantically corresponding text group set 1 and the semantic correspondence replacement text group set 1', to form a correspondence non-replacement translation model 4 and a correspondence replacement translation model 5. A translation probability composition unit 6 calculates a translation probability that each word in the semantically corresponding text group set 1 is translated to the other word from the translation models 4 and 5 to form a combined translation model 7. The combined translation model 7 is used to cluster a group of input texts. <P>COPYRIGHT: (C)2011,JPO&INPIT</p> |
申请公布号 |
JP2010267200(A) |
申请公布日期 |
2010.11.25 |
申请号 |
JP20090119886 |
申请日期 |
2009.05.18 |
申请人 |
NIPPON TELEGR & TELEPH CORP <NTT> |
发明人 |
NISHIKAWA HITOSHI;HASEGAWA TAKAAKI;IMAMURA KENJI |
分类号 |
G06F17/28;G06F17/30 |
主分类号 |
G06F17/28 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|