发明名称 Electronic document equivalence determination system and equivalence determination method
摘要 An equivalence determination system (10) according to this invention includes a specifying means (11) and determination means (12). The specifying means (11) specifies parts of respective electronic documents in a document database that are rarely modified manually. The determination means (12) determines whether the parts specified by the specifying means (11) match each other between a plurality of electronic documents, and when determining that the parts match each other, specifies that these documents are a plurality of similar electronic documents. An electronic document which cites part or all of another electronic document and is slightly modified can be quickly specified in the document database.
申请公布号 US8977949(B2) 申请公布日期 2015.03.10
申请号 US200812682722 申请日期 2008.10.10
申请人 NEC Corporation 发明人 Matsuda Katsushi
分类号 G06F17/00;G06F17/30;G06F17/22 主分类号 G06F17/00
代理机构 Sughrue Mion, PLLC 代理人 Sughrue Mion, PLLC
主权项 1. An equivalence determination system comprising: a processor; an object extracting unit, executed on the processor, that extracts, from respective electronic documents in a set of electronic documents, at least one object which forms the electronic document and includes at least one of a text, a figure, and an equation; a specifying unit that specifies predetermined number of objects in the respective electronic documents based on density calculated by referring to the extracted objects; and a judging unit that judges that plural electronic documents are similar based on the specified objects, wherein said specifying unit calculates the density of a given object, from among the objects, based on an area of the given object, and an amount of character string and decoration information contained in the given object, wherein said specifying unit calculates a weighted density of the given object by selecting objects having a first distance or smaller from the given object, and adding a (i) sum value of inverse proportion to each distance from the given object to the selected objects to (ii) the density of the object.
地址 Tokyo JP