摘要 |
<p>A method and apparatus for compressing inherently redundant data. A Unicode file is comprised of prefix group indicator bytes and suffix character indicator bytes and can therefore be separated into two files, one containing the prefixes and one containing the suffix characters. Then, each separate file can be separately compressed using means best suited to the characteristics of each. Because of the high degree of redundancy across the prefix group indicator bytes they can be more greatly compressed which in turn results in greater compression of the entire Unicode file. Multiple compression methodologies, equally applicable to any inherently redundant data file, can be applied to the prefix group indicator bytes to yield the best compression results.</p> |