摘要 |
A method and apparatus for performing compositions and decompositions of Unicode combined character sequences utilizes a preprocessor to generate compositions and decompositions of Unicode character sequences and a mapping table generates a plurality of tables use to access the tables. A decomposition mapping table, created from a Unicode database and rules, maps precomposed Unicode characters to their respective decompositions. A composition mapping table, derived from the decomposition mapping table, includes canonical equivalent combined character sequences of the mapped decompositions. Additionally, a normalized mapping table, created from the composition mapping table, maps valid combined character sequences consisting of the same characters, wherein one of the sequences is defined as a normalized form. The mapping tables are accessed by a runtime processor when a system entity requests a decomposition or composition of Unicode characters to provide the appropriate decomposition or composition.
|