发明名称 METHOD AND APPARATUS FOR TRAINING TARGET LANGUAGE WORD INFLECTION MODEL BASED ON BILINGUAL CORPUS, TLWI METHOD AND APPARATUS, AND TRANSLATION METHOD AND SYSTEM FOR TRANSLATING SOURCE LANGUAGE TEXT INTO TARGET LANGUAGE
摘要 <p><P>PROBLEM TO BE SOLVED: To provide a method and an apparatus for constructing a target language word inflection model (TLWI model) which can improve translation precision when translating into a target language having word inflections. <P>SOLUTION: A word string obtained by adding a part of speech to the base form of each word in a source language corpus is generated for a corpus pair of a source language corpus and a target language corpus, pre-processing for adding a part of speech to the base form of each word in the target language corpus to generate a word string with the part of speech added thereto is performed, a word C of the source language associated with a word W whose word form in the target language is inflected is obtained on the basis of word association information obtained by making a word in the pre-processed source language corpus associate with a word in a corresponding pre-processed target language corpus, and a pattern containing the word inflection information (TLWI information) of a word W of the target language is generated on the basis of a combination of the word W of the target language, the word C of the source language and words existing around the word C in un-pre-processed source language corpus. <P>COPYRIGHT: (C)2009,JPO&INPIT</p>
申请公布号 JP2009140499(A) 申请公布日期 2009.06.25
申请号 JP20080308753 申请日期 2008.12.03
申请人 TOSHIBA CORP 发明人 LIU ZHANYI;WAN HAIFEN;WU HUA
分类号 G06F17/28 主分类号 G06F17/28
代理机构 代理人
主权项
地址