发明名称 A METHOD FOR DETERMINING NEAR DUPLICATE DATA OBJECTS
摘要 A system for determining that a document B is a candidate for near duplicate to a document A with a given similarity level th. The system includes a storage for providing two different functions on the documents, each function having a numeric function value. The system further includes a processor associated with the storage and configured to determine that the document B is a candidate for near duplicate to the document A, if a condition is met. The condition includes: for any function f <SUB>i</SUB>
申请公布号 WO2006008733(A3) 申请公布日期 2007.03.01
申请号 WO2005IL00726 申请日期 2005.07.07
申请人 EQUIVIO LTD.;MILO, AMIR;RAVID, YIFTACH 发明人 MILO, AMIR;RAVID, YIFTACH
分类号 G06E1/00 主分类号 G06E1/00
代理机构 代理人
主权项
地址