发明名称 METHOD FOR DETERMINING NEAR DUPLICATE DATA OBJECTS
摘要 A system for determining that a document B is a candidate for near duplicate to a document A with a given similarity level th. The system includes a storage for providing two different functions on the documents, each function having a numeric function value. The system further includes a processor associated with the storage and configured to determine that the document B is a candidate for near duplicate to the document A, if a condition is met. The condition includes: for any function fi from among the two functions, fi(A)-fi(B)<=deltai(f,A,th).
申请公布号 US2009028441(A1) 申请公布日期 2009.01.29
申请号 US20050572441 申请日期 2005.07.07
申请人 EQUIVIO LTD 发明人 MILO AMIR;RAVID YIFTACH
分类号 G06K9/68 主分类号 G06K9/68
代理机构 代理人
主权项
地址