发明名称 METHODS OF CREATING A DICTIONARY FOR DATA COMPRESSION
摘要 Some aspects of the invention provide methods, systems, and computer program products for creating a static dictionary in which longer byte-strings are preferred. To that end, in accordance with aspects of the present invention, a new heuristic is defined to replace the aforementioned frequency count metric used to record the number of times a particular node in a data tree is visited. The new heuristic is based on counting the number of times an end-node of a particular byte-string is visited, while not incrementing a count for nodes storing characters in the middle of the byte-string as often as each time such nodes are visited. The result is an occurrence count metric that favours longer byte-strings, by being biased towards not incrementing the respective occurrence count values for nodes storing characters in the middle of a byte-string.
申请公布号 US2007229323(A1) 申请公布日期 2007.10.04
申请号 US20060278118 申请日期 2006.03.30
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 PLACHTA PIOTR M.;SAUER WOLFRAM;IYER BALAKRISHNA R.;WHITE STEVEN W.
分类号 H03M7/34 主分类号 H03M7/34
代理机构 代理人
主权项
地址