发明名称 Method and system for minimizing attribute naming errors in set oriented duplicate detection
摘要 The invention is a method for detecting duplicate records on a list or in a file and comprises a number of steps. The steps include entering a list, comprised of one or more records, to a data processing system; then, applying a nickname lookup table to the records to determine a common first name. Once a common name has been determined, the method matches a first record from the list with a second record from the list by comparing the fields of the first record with the fields of at least one other record; the comparison is based on a set of pre-determined criteria. The matching sequence determines a duplicate set, wherein the duplicate set is comprised of at least two records with fields that match. The method then lists matching records sequentially so that the system can create a new record by filling each empty field with a next available corresponding field from a subsequent record within the duplicate set. The newly created record is then retained on the original list; and the duplicate records are placed on a second list. Pre-sorting of the list can occur just prior to the matching sequence as well as just prior to outputting the final list. Additionally, the system operator can be given a number of options to provide flexibility. These options can include: manually correcting a record on the duplicate records list; deleting an address record from the list of duplicates; or, outputting the record.
申请公布号 US5799302(A) 申请公布日期 1998.08.25
申请号 US19950413579 申请日期 1995.03.30
申请人 PITNEY BOWES INC. 发明人 JOHNSON, ROBERT J.;SZTURMA, SHAWN W.
分类号 G06F12/02;G06F17/00;G06F17/30;G06F19/00;G06Q99/00;(IPC1-7):G06F17/30 主分类号 G06F12/02
代理机构 代理人
主权项
地址