摘要 |
An information structuring system for analyzing the structure of a document, comprising: an extraction unit which extracts a noun from a document and establishes an association between the extracted noun and at least one node stored in a database, thus associating the at least one node with the extracted noun; a candidate enumeration unit which, if a plurality of node candidates are associated with the extracted noun, searches for relay nodes for connecting the node candidates to nouns having identified identification information; a calculation unit which calculates first relevancy between each found relay node and each noun having identified identification information and second relevancy between each found relay node and each node candidate; a limiting unit which determines a relay node for which the first relevancy is high and the second relevancy is low; and a determination unit which determines nodes associated with the extracted noun, on the basis of node candidates associated with the determined relay node. |