发明名称 |
SIMILARITY DETERMINATION APPARATUS, SIMILARITY DETERMINATION METHOD, AND COMPUTER-READABLE RECORDING MEDIUM |
摘要 |
A determination apparatus(100) has a feature extraction unit(150b) and a similarity determination unit(150c). The feature extraction unit(150b) counts a number of appearances of each keyword included in a piece of document information and deletes any arrangement including a keyword having the number of appearances less than a threshold under a condition where a number of types of keyword arrangements included in a certain range of the piece of document information is equal to or greater than a certain number and extracts, as features, a plurality of keyword arrangements from the piece of document information. The similarity determination unit(150c) determines a similarity between the different pieces of document information by comparing the features extracted from pieces of document information different from each other. |
申请公布号 |
EP3046037(A1) |
申请公布日期 |
2016.07.20 |
申请号 |
EP20150200078 |
申请日期 |
2015.12.15 |
申请人 |
FUJITSU LIMITED |
发明人 |
KOZAKURA, FUMIHIKO;ITOH, KOUICHI |
分类号 |
G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|