发明名称 THESAURUS CONSTRUCTION SYSTEM, THESAURUS CONSTRUCTION METHOD, PROGRAM FOR EXECUTING THE METHOD, AND STORAGE MEDIUM WITH THE PROGRAM STORED THEREON
摘要 <P>PROBLEM TO BE SOLVED: To provide a thesaurus construction technique that can generate precise word clusters and construct a thesaurus by ensuring an affinity between data for thesaurus construction and processed text and reflecting modification relations in the processed text. <P>SOLUTION: A thesaurus construction system for automatically clustering words in object text to construct a thesaurus of the object text comprises a linguistic analysis part 2 for executing a linguistic analysis including a modification analysis for generating clauses and identifying modification relations between the clauses, a text data structure generation part 4 for using the results of the modification analysis to generate a data structure having clause information including notation, part-of-speech and modification information about component words, a text data structure storage part 5 for storing the text data structure, a word cluster generation part 7 for generating word clusters according to linguistic elements extracted from the stored text data structure, and a thesaurus generation part 8 for identifying relations between the word clusters according to the linguistic elements and constructing a thesaurus using the relations. <P>COPYRIGHT: (C)2005,JPO&NCIPI
申请公布号 JP2005025555(A) 申请公布日期 2005.01.27
申请号 JP20030191036 申请日期 2003.07.03
申请人 RICOH CO LTD 发明人 SATO NAOKO
分类号 G06F17/30;G06F17/28 主分类号 G06F17/30
代理机构 代理人
主权项
地址
您可能感兴趣的专利