发明名称 DATA DUPLICATION DETECTION IN AN IN MEMORY DATA GRID (IMDG)
摘要 Embodiments of the invention provide a method, system and computer program product for data duplication detection in an in memory data grid (IMDG). A method for data duplication detection in an IMDG includes computing a hash value for each binary data value in a key value pair of a partition in an IMDG. The method also includes generating a map including an entry for each unique computed hash value and one or more keys corresponding to binary data values of respective key value pairs from which the hash value had been uniquely computed. Thereafter, only those hash values in the map with multiple keys associated therewith are identified and binary data corresponding to the multiple keys of the identified hash values are reported as potential duplicate data in the IMDG.
申请公布号 US2015254267(A1) 申请公布日期 2015.09.10
申请号 US201514658233 申请日期 2015.03.15
申请人 International Business Machines Corporation 发明人 Berg Douglas;Gaur Nitin;Johnson Christopher D.;Martin Brian K.
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A method for data duplication detection in an in memory data grid (IMDG), the method comprising: computing by a processor of a computer a hash value for each binary data value in a key value pair of a partition in an IMDG; generating a map in memory of the computer the map including an entry for each unique computed hash value and one or more keys corresponding to binary data values of respective key value pairs from which the hash value had been uniquely computed; identifying only those hash values in the map with multiple keys associated therewith; and, reporting binary data corresponding to the multiple keys of the identified hash values as potential duplicate data in the IMDG.
地址 Armonk NY US