摘要 |
PROBLEM TO BE SOLVED: To reduce misretrieval in the case of registering a long document. SOLUTION: At the time of registering a document, a bit string component is calculated from each character code component and adjacent characters more than two characters, a single character component table indicating whether each document includes respective components or not and one or more adjacent character component tables are generated, the entry of each character component in the character component table is divided into plural blocks and collectively registered in a secondary storage. At the time of starting batch registration, a memory area with size specified by a user is secured and a block is allocated to each of all character entries stored in the secured memory area. When there is no writing space in a block at the time of registration, the contents of the block are stored in the secondary storage to empty the block and then data are written in the block so that registration can be prevented from being interrupted due to the shortage of memory capacity on the way of registration. |