摘要 |
METHODS ARE DISCLOSED FOR RECOVERING OR DETERMINING LOGICAL STRUCTURE OF A DOCUMENT BY ASSESSING DIFFERENT COMBINATIONS OF VERTICAL AND HORIZONTAL CUTS ACROSS A BLOCK OF THE DOCUMENT. THE BLOCK IS SEGMENTED USING A SCORING FUNCTION THAT DISCARDS HORIZONTAL CUTS IN FAVOR OF VERTICAL CUTS SHARED AMONG NEIGHBORING SUB-BLOCKS. THE ORDER IN WHICH THE BLOCKS AND SUB-BLOCKS ARE SEGMENTED IS THEN USED TO DEFINE THE LOGICAL STRUCTURE OF THE DOCUMENT, SUCH AS ITS READING ORDER. |