发明名称 Lossless compression of high nominal-range data
摘要 A method for receiving a data stream that includes data samples, each data sample having one of a plurality of actual values. For each data sample in the data stream, a first index in a dictionary is selected. The dictionary includes indices corresponding to each of the plurality of actual values. The first index corresponds to an actual value of the data sample. A predicted value of the data sample is generated in response to previously received data samples in the data stream and to a prediction algorithm. A second index in the dictionary that corresponds to an actual value in the dictionary that is closest to the value of the predicted value is selected. The difference between the first index and the second index is calculated and compressed. The compressed difference between the first index and the second index is then output. This process is performed for each data sample in the data stream.
申请公布号 US8990217(B2) 申请公布日期 2015.03.24
申请号 US201113181880 申请日期 2011.07.13
申请人 International Business Machines Corporation 发明人 Jagmohan Ashish;Knight Joshua W.;Lastras-Montano Luis A.
分类号 G06F17/30;H03M7/30 主分类号 G06F17/30
代理机构 Cantor Colburn LLP 代理人 Cantor Colburn LLP
主权项 1. A method comprising: receiving a data stream comprising data samples, each of the data samples having one of a plurality of actual values; classifying the data stream into a plurality of different classes; and performing for each of the data samples: selecting a first index in a dictionary, the dictionary comprising indices corresponding to each of the plurality of actual values in a sorted order of the actual values, the first index corresponding to an actual value of the data sample quantized to the dictionary and assigned an index value in a sorted order according to the sorted order of the actual values;using for each of the different classes a different dictionary for selecting the first index;generating a predicted value of the data sample in response to previously received data samples in the data stream and to a prediction algorithm, wherein the prediction algorithm is selected in response to the class of the data sample and the value is constructed based on a class based unique value dictionary;selecting a second index of the predicted value in the dictionary such that the second index is an index value that corresponds to an actual value in the sorted order of the actual values in the dictionary that is closest to a quantization of the predicted value;calculating a difference between the first index and the second index as a quantized label difference;compressing the difference between the first index and the second index; andoutputting the compressed difference between the first index and the second index.
地址 Armonk NY US