发明名称 System and method for data compression using multiple small encoding tables
摘要 A system and method for compressing and decompressing multiple types of character data. The system and method employ multiple encoding tables, each designed for encoding a subset of character data, such as numeric data, uppercase letters, lowercase letters, Latin, or UNICODE data, to perform compressions and decompression of character data, and. The compression solution also provides for the creation of new encoding tables on the fly, and accommodates the compression of lengthy character streams using multiple different general compression algorithms, automatically choosing a suitable general compression algorithm for specific input data.
申请公布号 US8872679(B1) 申请公布日期 2014.10.28
申请号 US201213705217 申请日期 2012.12.05
申请人 Teradata US, Inc. 发明人 Roberts Gary;Wang Guilian
分类号 H03M7/00;H03M7/42 主分类号 H03M7/00
代理机构 代理人 Stover James
主权项 1. A method for compressing character data, the method comprising the steps of: maintaining within a computer system a plurality of encoding tables corresponding to a plurality of character alphabets; receiving, by said computing system, an input stream containing uncompressed character data; determining, by said computing system, the length of said input string containing uncompressed character data; when the length of said input stream exceeds a predefined length value, compressing, by said computing system, said uncompressed character data using a general purpose compression algorithm; and when the length of said input stream does not exceed said predefined length value, selecting, by said computing system, an encoding table from said plurality of encoding tables, said selected encoding table corresponding to the character alphabet of said uncompressed character data contained within said input stream, and compressing, by said computing system, said uncompressed character data using said selected encoding table to provide compressed character data.
地址 Dayton OH US