发明名称 SIMILARITY DETERMINATION APPARATUS, SIMILARITY DETERMINATION METHOD, AND COMPUTER-READABLE RECORDING MEDIUM
摘要 A determination apparatus(100) has a feature extraction unit(150b) and a similarity determination unit(150c). The feature extraction unit(150b) counts a number of appearances of each keyword included in a piece of document information and deletes any arrangement including a keyword having the number of appearances less than a threshold under a condition where a number of types of keyword arrangements included in a certain range of the piece of document information is equal to or greater than a certain number and extracts, as features, a plurality of keyword arrangements from the piece of document information. The similarity determination unit(150c) determines a similarity between the different pieces of document information by comparing the features extracted from pieces of document information different from each other.
申请公布号 EP3046037(A1) 申请公布日期 2016.07.20
申请号 EP20150200078 申请日期 2015.12.15
申请人 FUJITSU LIMITED 发明人 KOZAKURA, FUMIHIKO;ITOH, KOUICHI
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址