发明名称 Internal Linking Co-Convergence Using Clustering With Hierarchy
摘要 Certain implementations of the disclosed technology include systems and methods for internal co-convergence using clustering when there is hierarchy in the data structure. A method is included for clustering hierarchical database records into a first set of clusters having corresponding first cluster identifications (IDs), each hierarchical database record including one or more field values, the clustering based at least in part on determining similarity among corresponding field values of the hierarchical database records. The method includes receiving parent-child hierarchical relationship information for the hierarchical database records, re-clustering at least a portion of the hierarchical database records into a second set of clusters having corresponding second cluster IDs, the re-clustering based at least in part on the received parent-child hierarchical relationship information, and outputting hierarchical database record information, based at least in part on the re-clustering.
申请公布号 US2016283575(A1) 申请公布日期 2016.09.29
申请号 US201615172969 申请日期 2016.06.03
申请人 LexisNexis Risk Solutions FL Inc. 发明人 Bayliss David Alan
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A computer-implemented method comprising: clustering hierarchical database records into a first set of clusters having corresponding first cluster identifications (IDs), wherein each hierarchical database record corresponds to an entity representation, each hierarchical database record comprising a plurality of fields, each field configured to contain a field value, and each field value assigned a field value weight corresponding to a specificity of the field value in relation to all field values in a corresponding field of the hierarchical database records, the clustering based at least in part on determining similarity among corresponding field values of the hierarchical database records; determining parent-child hierarchical relationships among the hierarchical database records; associating related hierarchical database records by applying a hierarchal directional linking process, the hierarchal directional linking process comprising selecting and applying at least an upward process based on the determined parent-child hierarchical relationship wherein the upward process comprises: determining, from the parent-child hierarchical relationships, similarity among a plurality of child records having initial separate parent records;in response to determining a threshold similarity among the plurality of child records, inferring that the initial separate parent records correspond to the same entity; andlinking, responsive to the inferring, the initial separate parent records as inferred common parent records; and outputting database record information, based at least in part on associating the related hierarchical database records.
地址 Boca Raton FL US