发明名称 DOCUMENT SIMILARITY EVALUATION SYSTEM, DOCUMENT SIMILARITY EVALUATION METHOD, AND COMPUTER PROGRAM
摘要 Disclosed is a document similarity evaluation system or the like which can evaluate a degree of concentration and dispersion of parts with high similarity in at least two kinds of documents. The system includes a segment search unit which finds common segments (CS) in first and second segment strings, counts the number of CS, and identifies an appearance range (AR) within CS; and a similarity index (SI) calculation unit which calculates a first sum that is a sum of the numbers of characters of each segment (NCS) in AR and a second sum that is a sum of NCS of CS and calculates SI between the first and second segment strings by the following equation, SI=F(NTC)/G(NCC)×NS (where, NTC is the first sum, NCC is the second sum, NS is the number of the CS, functions F and G monotonically increase at larger than 0).
申请公布号 US2013191410(A1) 申请公布日期 2013.07.25
申请号 US201213672794 申请日期 2012.11.09
申请人 NEC CORPORATION;NEC CORPORATION 发明人 ZHOU WENQI
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址