发明名称 Hierarchical data compression and computation
摘要 According to embodiments of the present invention, machines, systems, methods and computer program products for hierarchical compression of data are presented comprising creating a compression hierarchy of compression nodes, wherein each compression node is associated with a compression operation to produce compressed data. An output of any of the compression nodes may be compressed by another compression node or the same compression node. A path of one or more compression nodes is determined through said compression hierarchy based upon compression statistics to compress data, and the data is compressed by the compression nodes of the path. Various computational techniques are presented herein for manipulating the compression hierarchy to defer or reduce computation during query evaluation.
申请公布号 US9535923(B2) 申请公布日期 2017.01.03
申请号 US201615071376 申请日期 2016.03.16
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 Ackerman William B.;Dickie Garth A.;Keller Jeffrey M.
分类号 H03M7/34;G06F17/30;H03M7/30;H03M7/46 主分类号 H03M7/34
代理机构 Edell, Shapiro & Finnan, LLC 代理人 Choi Edward;Edell, Shapiro & Finnan, LLC
主权项 1. A system for hierarchically compressing data comprising: one or more processors configured to: create a compression hierarchy of compression nodes, wherein each compression node is associated with a compression operation to produce compressed data, and wherein creating a compression hierarchy comprises: creating a run length encoding compression node in the compression hierarchy of compression nodes;determine a path of one or more compression nodes through said compression hierarchy, based upon compression statistics, to compress data; andcompress the data by the compression nodes of the path, wherein compressing the data comprises: performing part of a join operation on the data using the run length encoding compression node, wherein the join operation is performed by applying input data to a values field of the run length encoding compression node and by applying repeat counts to a lengths field of the run length encoding compression node.
地址 Armonk NY US