摘要 |
A structured document encoder for encoding a structured document which defines a tree structure including nodes includes: a node identifier assigning unit for assigning a node identifier to each of the nodes; a node position information generator for generating node position information for each of the nodes, node position information of an given node from the nodes comprising at least an identifier of the given node, an identifier of a child node of the given node, and an identifier of a next sibling node which has the same parent node as the given node; and a structured document encoded representation generator for generating a structured document encoded representation by combining the node position information and the node content information of all of the nodes.
|