摘要 |
PROBLEM TO BE SOLVED: To provide a database management system for suppressing the increase of data amounts as much as possible even when character strings with different lengths are handled as the same index word due to the shake of notation, and for correctly searching the appearing place of the character string in an original document. SOLUTION: In this database management system having a document database which manages a document and an index database which manages position information in the document in which index words are included, a read document is registered as an original document in the document database. Also, the corresponding table of the position in a document(normalized document) normalized by using a representative index word in the original document and the position in the original document is prepared, and an index is prepared by converting a start position and end position where the index word is detected into the start position and end position in the original document by referring to the corresponding table for all the index words extracted from the normalized document, and the converted index of the index is prepared and stored in the index database. COPYRIGHT: (C)2006,JPO&NCIPI
|