摘要 |
Systems, methods and apparatus provide a character process for processing markup language data, such as XML data, by receiving a character stream of markup language data and applying sequences of characters of the character stream to a set of state machines. The set of state machines includes a plurality of construct state machines responsible for processing respective markup language constructs identified by the sequences of characters. The character processor produces, from application of the sequences of characters to the set of state machines, an intermediate representation of the markup language constructs identified by the sequence of characters of the character stream of markup language data. The intermediate representation contains encoded items containing type, length, value representations representative of constructs within the character stream of markup language data.
|