发明名称 Enterprise Data Duplication Identification
摘要 Systems, methods, and computer program products are provided for identifying duplicate data. In one exemplary embodiment, there is provided a method for identifying duplicate data. The method may include identifying one or more reference fields that include one or more data values. The method may include retrieving the one or more reference fields and one or more data values. The method may also include transforming the one or more reference fields into one or more reference fingerprint patterns. The method may also include identifying one or more target fields that include one or more target field values. The method may also include retrieving the one or more target fields. The method may also include transforming the one or more target field values into one or more target fingerprint patterns. The method may also include comparing the one or more reference fingerprint patterns with the one or more target fingerprint patterns. The method may further include determining an overlap between the one or more reference fingerprint patterns and the one or more target fingerprint patterns.
申请公布号 US2012059827(A1) 申请公布日期 2012.03.08
申请号 US20100874391 申请日期 2010.09.02
申请人 BRITTAIN BRIAN;COOPER MARK;YERRAMSETTY VISWANATH 发明人 BRITTAIN BRIAN;COOPER MARK;YERRAMSETTY VISWANATH
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址