发明名称 Method and system for analyzing the logical structure of a document
摘要 An input document is matched with predetermined patterns on a line-by-line basis, whereby it can be assigned a plurality of pairs of attributes and costs. When the process for the whole document is completed, in accordance with a rule specifying the combination of attributes between the adjacent lines, the nodes of a graph are generated, the nodes are linked with each other, and costs are given to the node and links. There is a plurality of paths for traveling the graph from the root node to the final node, and each of them means the interpretation of a possible logical structure of the document. By summing the costs for the traveled nodes and links, a total cost value can be associated with each path, and by prioritizing by this total cost value, a plurality of logical structure interpretations can be sequentially shown from the most plausible path (logical structure interpretation). A chosen logical structure is tagged as required.
申请公布号 US5669007(A) 申请公布日期 1997.09.16
申请号 US19950395559 申请日期 1995.02.28
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 TATEISHI, YUKA
分类号 G06K9/62;G06F17/21;G06K9/20;G06T11/60;(IPC1-7):G06F17/27 主分类号 G06K9/62
代理机构 代理人
主权项
地址