发明名称 INFORMATION EXTRACTION DEVICE
摘要 <p>PROBLEM TO BE SOLVED: To provide an information extraction device that not only extracts a subject from plural documents but also extracts various pieces of information for effectively using the subject. SOLUTION: A word analysis part 2 and a subject analysis part 4 detect the temporal distribution of specified words contained in the plural documents held by a document data base 1 based on the update date/time of the documents and extract the word whose intensity of distribution is high as the subject word. A subject category analysis part 6 extracts the other word contained in the same document as the subject word as a category word. A subject category storage part 7 classifies and manages the subject word by using the category word. A subject evaluation analysis part 9 detects 8 keyword which is contained in the same document as the subject word and which is similar to that held by an evaluation keyword storage part 8, and a subject evaluation storage part correspondingly manages the subject word and the detected keyword. Thus, the plural subject words can be extracted by associating them by the category word. Then, the keyword showing evaluation on the subject word can be extracted.</p>
申请公布号 JPH10340275(A) 申请公布日期 1998.12.22
申请号 JP19970166516 申请日期 1997.06.09
申请人 FUJI XEROX CO LTD 发明人 HAYASHI NAOKI;TANAKA TAKESHI;MUNAKATA HIDEAKI
分类号 G06F17/30;G06F19/00;(IPC1-7):G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址