发明名称 |
REORDERING OF DATABASE RECORDS FOR IMPROVED COMPRESSION |
摘要 |
According to embodiments of the present invention, apparatus, systems, methods and computer program products for sorting and compressing an unordered set of data records from a structured database are provided. Fields of the unordered set of data records are prioritized based on an impact of those fields to a compression scheme for column-oriented compression. The unordered set of data records are sorted based on the prioritized field(s) with a greatest impact on the performance metric. Data of the sorted data records are compressed according to a compression scheme. In some embodiments, prioritizing the fields may be based on an anticipated level of usage of data within those fields and/or a cost function associated with a performance metric as well as optimization of compression. A performance metric may include a faster computational time, reduced I/O computation, faster scan time, etc. |
申请公布号 |
US2015347426(A1) |
申请公布日期 |
2015.12.03 |
申请号 |
US201514644762 |
申请日期 |
2015.03.11 |
申请人 |
International Business Machines Corporation |
发明人 |
Dickie Garth A.;Keller Jeffrey M. |
分类号 |
G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
1. A method of compressing an unordered set of data records from a structured database comprising:
prioritizing fields of the unordered set of data records based on an impact of those fields to a performance metric for accessing data stored in a column-oriented compressed database; sorting the unordered set of data records based on the one or more prioritized field with a greatest impact on the performance metric; and compressing data of the sorted data records according to a compression scheme. |
地址 |
Armonk NY US |