发明名称 ROBUST DETECTOR OF FUZZY DUPLICATE
摘要 <p><P>PROBLEM TO BE SOLVED: To detect fuzzy duplicates, and eliminate such duplicates in at least one implementation. <P>SOLUTION: Fuzzy duplicates are multiple, seemingly distinct tuples (i.e., records) in a database that represent the same real-world entity or phenomenon. A solution to a fuzzy duplicate elimination problem is scale invariant such that a scale of a distance function does impact local structural properties of the tuples. It is split/merge consistent in that shrinking distances between tuples in a group of duplicates, and expanding distances between tuples across groups may only change a partition in limited ways. It has a constrained richness such that a range of a duplicate elimination function allows all groupings that would be useful in practice. <P>COPYRIGHT: (C)2006,JPO&NCIPI</p>
申请公布号 JP2006072985(A) 申请公布日期 2006.03.16
申请号 JP20050221802 申请日期 2005.07.29
申请人 MICROSOFT CORP 发明人 MOTWANI RAJEEV;CHAUDHURI SURAJIT;GANTI VENKATESH
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址