发明名称 |
INFORMATION EXTRACTING SYSTEM, PROGRAM AND METHOD, AND DOCUMENT EXTRACTING SYSTEM, PROGRAM AND METHOD |
摘要 |
PROBLEM TO BE SOLVED: To provide an information extracting system which can execute information extraction conforming to a user's requirement for excluding the overlap of contents and reduce a cost required for work. SOLUTION: The information extracting system is provided with a document data registration DB 12, a similarity computing section 14 which computes similarity among the document data of the document data registration DB 12, a document data classifying section 16 which classifies hierarchically the document data of the document data registration DB 12 based on the similarity computed by the similarity computing section 14, and a document data extracting section 20 which extracts the document data from document data groups classified by the document data classifying section 16 based on a prescribed value and a prescribed classification rule. COPYRIGHT: (C)2005,JPO&NCIPI
|
申请公布号 |
JP2004318527(A) |
申请公布日期 |
2004.11.11 |
申请号 |
JP20030111982 |
申请日期 |
2003.04.16 |
申请人 |
SEIKO EPSON CORP |
发明人 |
KAYAHARA NAOKI;OHASHI HIROTAKA |
分类号 |
G06F17/30;(IPC1-7):G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|