发明名称 SYSTEMS AND METHODS FOR EFFICIENT DATA SEARCHING, STORAGE AND REDUCTION
摘要 A computer program product for searching a repository of binary uninterpretted data, according to one embodiment, includes a computer readable storage medium having program instructions executable by a computer to cause the computer to perform a method comprising: analyzing, by the computer, segments of each of the repository and input data to determine a repository segment that is similar to an input segment, the analyzing including searching an index of representation values of the repository data for matching representation values of the input in a time independent of a size of the repository and linear in a size of the input data; and analyzing, by the computer, the similar repository segment with respect to the input segment to determine their common data sections while utilizing at least some of the matching representation values for data alignment, in a time linear in a size of the input segment.
申请公布号 US2016335285(A1) 申请公布日期 2016.11.17
申请号 US201615219127 申请日期 2016.07.25
申请人 International Business Machines Corporation 发明人 Aronovich Lior;Asher Ron;Bachmat Eitan;Bitner Haim;Hirsch Michael;Klein Shmuel T.
分类号 G06F17/30;G06F11/14 主分类号 G06F17/30
代理机构 代理人
主权项 1. A computer program product for searching a repository of binary uninterpretted data for a location of common data to an input data, the computer program product comprising a computer readable storage medium having program instructions executable by a computer to cause the computer to perform a method comprising: analyzing, by the computer, segments of each of the repository and input data to determine a repository segment that is similar to an input segment, the analyzing including searching an index of representation values of the repository data for matching representation values of the input in a time independent of a size of the repository and linear in a size of the input data; and analyzing, by the computer, the similar repository segment with respect to the input segment to determine their common data sections while utilizing at least some of the matching representation values for data alignment, in a time linear in a size of the input segment.
地址 Armonk NY US