发明名称 METHOD AND DEVICE FOR EXTRACTING DOCUMENT INFORMATION AND STORAGE MEDIUM STORING DOCUMENT EXTRACTING PROCESS PROGRAM
摘要 PROBLEM TO BE SOLVED: To extract a group of document contents as a context since it is hard to extract part of a document or find differences between two documents, if a document is fractionized too much. SOLUTION: Paragraphs are detected from the document contents of one document, the document contents are divided by the paragraphs, and a morpheme analysis is carried out by the paragraphs. Then featured elements are extracted on the basis of the morpheme analytic results (step s1) and a feature table is generated which shows the relation ship between the featured elements and the paragraphs including the featured elements (step s2). On the basis of this feature table, the document is classified by contents as meaningful groups (step s3) and when a content selection indication is received from a user (step s4), the document contents of the paragraph belonging to the selected content are outputted (steps s5 and s6).
申请公布号 JPH10320409(A) 申请公布日期 1998.12.04
申请号 JP19970128986 申请日期 1997.05.19
申请人 SEIKO EPSON CORP 发明人 MIWA SHINJI
分类号 G06F17/21;G06F17/27;G06F17/30 主分类号 G06F17/21
代理机构 代理人
主权项
地址