发明名称 TEXT FILE COMPRESSION SYSTEM
摘要 A system for compressing an ASCII or similarly encoded text file is described. The system creates an alphabetically ordered main dictionary listing all unique words appearing in the text file. A text file "word" is defined as a sequence of characters ending with one or more "word terminators" such as spaces, commas, periods and carriage returns. The compression system also creates a common word dictionary referencing words most often encountered in the text file. The sequence of words forming the text file is represented by a word index, a list of one byte and two byte references to common and main dictionary words, respectively. The system compresses the main dictionary using three complementary techniques. First, leading characters of each dictionary word matching leading characters of a next preceding dictionary word are represented by data indicating the number of matching characters. Second, commonly encountered dictionary word suffixes are represented by data referencing entries of a small suffix dictionary. Third, remaining characters of main dictionary words are represented by bytes encoded to represent commonly encountered characters and groups of characters. The system also compresses style data structures often included in word processing text files.
申请公布号 WO9840969(A2) 申请公布日期 1998.09.17
申请号 WO1998US05134 申请日期 1998.03.16
申请人 J.STREAM, INC. 发明人 CRANDALL, GARY, E.
分类号 H03M7/30 主分类号 H03M7/30
代理机构 代理人
主权项
地址