发明名称 Method, controller, program and data storage system for performing reconciliation processing
摘要 Embodiments include a method for reconciling a target data node with a data graph encoding a plurality of interconnected data nodes. The method comprises filtering an initial candidate set of data nodes from among the plurality of interconnected data nodes by performing a partial comparison process of a member of the initial candidate set with the target data node. The partial comparison process comprises using a first set of hash functions to compare a first set of features extracted from each of the member and the target data node, and if the outcome of the partial comparison process satisfies one or more removal criteria, removing: the member from the initial candidate set; and any other members from the initial candidate set assessed as having a semantic similarity with the member above a semantic similarity threshold. The partial comparison process further comprises repeating the performing, and removing on condition of the removal criterion being satisfied, until each remaining member of the initial candidate set has had the partial comparison process with the target data node completed. The method further comprises performing full comparison processing between the target data node and each remaining member of the initial candidate set following the filtering, the full comparison processing comprising using a second set of hash functions to compare a second set of features extracted from both the remaining member and the target data node. Wherein the second set of hash functions contains more hash functions than the first set of hash functions.
申请公布号 EP3001329(A1) 申请公布日期 2016.03.30
申请号 EP20140186396 申请日期 2014.09.25
申请人 FUJITSU LIMITED 发明人 HU, BO
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址