发明名称 REORDERING OF DATABASE RECORDS FOR IMPROVED COMPRESSION
摘要 According to embodiments of the present invention, apparatus, systems, methods and computer program products for sorting and compressing an unordered set of data records from a structured database are provided. Fields of the unordered set of data records are prioritized based on an impact of those fields to a compression scheme for column-oriented compression. The unordered set of data records are sorted based on the prioritized field(s) with a greatest impact on the performance metric. Data of the sorted data records are compressed according to a compression scheme. In some embodiments, prioritizing the fields may be based on an anticipated level of usage of data within those fields and/or a cost function associated with a performance metric as well as optimization of compression. A performance metric may include a faster computational time, reduced I/O computation, faster scan time, etc.
申请公布号 US2015347426(A1) 申请公布日期 2015.12.03
申请号 US201514644762 申请日期 2015.03.11
申请人 International Business Machines Corporation 发明人 Dickie Garth A.;Keller Jeffrey M.
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A method of compressing an unordered set of data records from a structured database comprising: prioritizing fields of the unordered set of data records based on an impact of those fields to a performance metric for accessing data stored in a column-oriented compressed database; sorting the unordered set of data records based on the one or more prioritized field with a greatest impact on the performance metric; and compressing data of the sorted data records according to a compression scheme.
地址 Armonk NY US