摘要 |
PROBLEM TO BE SOLVED: To store the data of large capacity with high compressibility by extracting document information components from inputted document information, to which morpheme analytic processing is performed, and performing encoding and compressing processing. SOLUTION: A document information morpheme analytic part 1 extracts a word (including a morpheme) by performing morpheme analysis to the inputted document information. A morpheme analyzed data encoding part 2 encodes the extracted word into numerical value. An encoded data compressing part 3 further encodes the encoded morpheme data into different numerical value and compresses it. A data base 4 stores the compressed result of compressing processing. Based on the encoded morpheme encoded by the encoding part 2, a document information index preparing part 5 prepares a document information index corresponding to the document information stored in an information storage and retrieval device 100. This document information index is used for retrieving the document information or the like and recorded in a document information index storage part 6. |