摘要 |
PROBLEM TO BE SOLVED: To provide a structured document management apparatus that generates an index used for quick search of structured documents. SOLUTION: The structured document management apparatus comprises a vocabulary index storage part 143 for storing, in pages as a fixed length storage area, a vocabulary index associating vocabulary identifiers identifying a vocabulary included in structured documents having a hierarchical logical structure and identification information identifying the positions where the vocabulary appears, a feature analysis part 114 for analyzing features of the distribution of the identification information included in the vocabulary index stored in the pages, and a block division part 115 for dividing the pages into a plurality of blocks containing one or more vocabulary indexes according to the analyzed features and saving a first area calculated for each divided block in the vocabulary index storage part 143. COPYRIGHT: (C)2009,JPO&INPIT |