发明名称 Behavior based record linkage
摘要 A computer implemented method for matching data records from multiple entities comprising providing respective transaction logs for the entities representing actions performed by or in respect of the entities, determining a matching score using the transaction logs for respective pairs of the entities and for predetermined combinations of merged entities by generating a measure representing a gain in behavior recognition for the entities before and after merging, and using the gain as a matching score.
申请公布号 US9514167(B2) 申请公布日期 2016.12.06
申请号 US201113195319 申请日期 2011.08.01
申请人 QATAR FOUNDATION 发明人 Yakout Mohamed;Elmagarmid Ahmed K.;Elmeleegy Hazem;Ouzzani Mourad;Qi Yuan
分类号 G06F17/30 主分类号 G06F17/30
代理机构 Mossman, Kumar & Tyler PC 代理人 Mossman, Kumar & Tyler PC
主权项 1. A computer implemented method, with at least one step executed by a computer, the method for matching data records from multiple entities to identify if the multiple entities are the same entity, comprising: providing respective transaction logs for the multiple entities representing actions performed by or in respect of the multiple entities; extracting behavior data for the multiple entities from the transaction logs; determining candidate entity matches between pairs of entities using the behavior data of each entity of the pair by generating pairs of entity matches and using those pairs not discarded by a coarse matching function as candidate entity matches; merging the behavior data of each pair of candidate entity matches to generate a merged behavior matrix for each pair; calculating a behavior recognition score for the merged behavior matrix and for each entity of the pair of candidate entity matches; determining a gain from the behavior recognition score for each entity of the pair of candidate entity matches to the recognition score for the merged behavior matrix of the respective pair of candidate entity matches; determining a matching score for each pair of candidate entity matches using the gain in behavior recognition score, wherein the gain in behavior recognition score is indicative of the two entities in the pair of candidate entity matches being the same entity; identifying which entities represent the same entity among the multiple entities if the matching score is above a predetermined threshold; and associating the identified matching entities of the multiple entities as the same entity.
地址 Doha QA
您可能感兴趣的专利