发明名称 DOCUMENT PROCESSING UNIT AND METHOD
摘要 PROBLEM TO BE SOLVED: To properly extract a label and keywords of a document group. SOLUTION: A phrase extracting part 1 extracts an important phrase from each sentence by morphological analysis. A phrase importance score calculating part 2 calculates a phrase importance score per each phrase. An inclusive relation analyzing part 3 creates a table indicating inclusive relations of the important phrases. A label extraction score calculating part 4 newly calculates a label extraction score from the phrase importance score of each phrase so that a label extraction score of an included phrase is higher than a label extraction score of an inclusive phrase. A keyword extraction score calculating part 5 calculates a keyword extraction score by adjusting the phrase importance score so that the included phrase is extracted as a keyword. A label selecting part 6 selects a phrase with the highest label extraction score as a label. A keyword selecting part 7 selects several phrases with high phrase importance scores from the top as keywords. COPYRIGHT: (C)2005,JPO&NCIPI
申请公布号 JP2005063298(A) 申请公布日期 2005.03.10
申请号 JP20030295182 申请日期 2003.08.19
申请人 FUJI XEROX CO LTD 发明人 SHIBATA YAYOI;UMEKI HIROSHI
分类号 G06F17/30;(IPC1-7):G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址