发明名称 |
A METHOD FOR DETERMINING NEAR DUPLICATE DATA OBJECTS |
摘要 |
A system for determining that a document B is a candidate for near duplicate to a document A with a given similarity level th. The system includes a storage for providing two different functions on the documents, each function having a numeric function value. The system further includes a processor associated with the storage and configured to determine that the document B is a candidate for near duplicate to the document A, if a condition is met. The condition includes: for any function f <SUB>i</SUB> |
申请公布号 |
WO2006008733(A3) |
申请公布日期 |
2007.03.01 |
申请号 |
WO2005IL00726 |
申请日期 |
2005.07.07 |
申请人 |
EQUIVIO LTD.;MILO, AMIR;RAVID, YIFTACH |
发明人 |
MILO, AMIR;RAVID, YIFTACH |
分类号 |
G06E1/00 |
主分类号 |
G06E1/00 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|