发明名称 Matching data based on numeric difference
摘要 Systems and methods for matching data based on numeric difference are described herein. Input data elements are parsed to identify a first number and a second number. A difference between the first number and the second number is calculated based on a predefined formula. Based on the difference, a matching score between the input data elements is evaluated. The matching score is proportional to a base matching score corresponding to a threshold difference, and a maximum score corresponding to a match between the first number and the second number. A similarity between the input data elements is reported based on the evaluated matching score.
申请公布号 US9229971(B2) 申请公布日期 2016.01.05
申请号 US201012973961 申请日期 2010.12.21
申请人 Business Objects Software Limited 发明人 Woody Jeffrey;Gujjewar Abhiram;Spiess Mark
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A computer system for matching data based on numeric difference, the computer system comprising a processor, the processor communicating with one or more memory devices storing instructions, the instructions operable to: receive a first data element from a first data source of multiple data sources communicatively accessible at said computer system, wherein said multiple data sources include one or more data sources selected from a group consisting of a file, a database table and an electronic message; receive a second data element from a second data source of said multiple data sources; parse said first data element to identify and convert numeric characters to a first number, wherein said first number includes at least one digit; parse said second data element to identify and convert numeric characters to a second number, wherein said second number includes at least one digit; at a runtime environment of said computer system, generate a matching score based on a numeric difference between the magnitudes of said first number and said second number, wherein said numeric difference is calculated at said computer system based on metadata accessible by said runtime environment; and consolidate said first data element with said second data element into a master record when said matching score is greater than or equal to a base score, wherein said master record includes information selected from one or more of said first data element and said second data element based on one or more priorities selected from a group consisting of source, frequency, completeness and recency.
地址 Dublin IE