发明名称 Method for compressing character-based markup language files including non-standard characters
摘要 A method for compressing character-based markup language files in a web document prior to compression of the entire web document. The method first includes converting the tags and the attributes of the tags to a single case format. Then, the attributes are placed in a specified order within the tags in order to make the tags more uniform and to enable larger strings of common text to be found. Finally, any unnecessary white spaces and end-of-line characters are eliminated to decrease the size of the file. Then, the shorter of two alternative text string representations of any non-standard characters will be determined and used in order to further decrease the size of the file. The document that results from the method of the invention will compress more efficiently, yet the content is semantically identical to its original form.
申请公布号 US2002107866(A1) 申请公布日期 2002.08.08
申请号 US20010800846 申请日期 2001.03.06
申请人 COUSINS ROBERT E.;SILVA JENNIFER N. 发明人 COUSINS ROBERT E.;SILVA JENNIFER N.
分类号 H03M7/30;(IPC1-7):G06F7/00 主分类号 H03M7/30
代理机构 代理人
主权项
地址