发明名称 |
DOCUMENT PROCESSING UNIT AND METHOD |
摘要 |
PROBLEM TO BE SOLVED: To properly extract a label and keywords of a document group. SOLUTION: A phrase extracting part 1 extracts an important phrase from each sentence by morphological analysis. A phrase importance score calculating part 2 calculates a phrase importance score per each phrase. An inclusive relation analyzing part 3 creates a table indicating inclusive relations of the important phrases. A label extraction score calculating part 4 newly calculates a label extraction score from the phrase importance score of each phrase so that a label extraction score of an included phrase is higher than a label extraction score of an inclusive phrase. A keyword extraction score calculating part 5 calculates a keyword extraction score by adjusting the phrase importance score so that the included phrase is extracted as a keyword. A label selecting part 6 selects a phrase with the highest label extraction score as a label. A keyword selecting part 7 selects several phrases with high phrase importance scores from the top as keywords. COPYRIGHT: (C)2005,JPO&NCIPI
|
申请公布号 |
JP2005063298(A) |
申请公布日期 |
2005.03.10 |
申请号 |
JP20030295182 |
申请日期 |
2003.08.19 |
申请人 |
FUJI XEROX CO LTD |
发明人 |
SHIBATA YAYOI;UMEKI HIROSHI |
分类号 |
G06F17/30;(IPC1-7):G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|