发明名称 |
ENHANCED COMPRESSION, ENCODING, AND NAMING FOR RESOURCE STRINGS |
摘要 |
Technology is disclosed herein for compressing, encoding, and otherwise reducing the size of resource files. In at least one implementation, similarity compression is employed to reduce the size of a resource file. In another implementation, map-less encoding is employed to reduce the number of bytes used to represent a resource string. Bit-level compression is employed in another implementation to reduce the quantity of bits used to encode each character in a string. In addition, implementations are disclosed related to technology for naming strings and accelerated string location and retrieval. |
申请公布号 |
US2016203152(A1) |
申请公布日期 |
2016.07.14 |
申请号 |
US201514594559 |
申请日期 |
2015.01.12 |
申请人 |
Microsoft Technology Licensing, LLC |
发明人 |
Burukhin Anatoliy;Moore Thomas Gary;Lavoy Thomas Jeffrey;Keary Rory |
分类号 |
G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
1. A method to facilitate enhanced resource file compression comprising:
ordering a set of resource strings in a resource file to produce an ordered set of resource strings in the resource file, wherein each of the ordered set of resource strings comprises a set of characters; and reducing a size of the resource file by, for any resource string of the ordered set of resource strings that qualifies for map-less encoding, at least: identifying a double-byte Unicode representation of each character in the set of characters in the resource string, wherein the double-byte Unicode representation comprises a lower byte and an upper byte; identifying at least one character in the set of characters for which the upper byte of the one character comprises a non-zero value, wherein the non-zero value indicates an occurrence of non-Latin characters; setting a value of an encoding byte to the non-zero value and retaining the encoding byte in the resource file to reflect the occurrence of the non-Latin characters; discarding the upper byte from the resource file for each of the set of characters; and retaining the lower byte in the resource file for each of the set of characters. |
地址 |
Redmond WA US |