摘要 |
After three kinds of data, i.e., a keyword frequency-of-appearance (103), a document length (105), and a keyword weight (107) are produced, a document profile vector (111) and a keyword profile vector (109) are calculated. Then, by independently performing the weighted principal component analysis (112,114) considering the document length and the keyword weight, a document feature vector and a keyword feature vectors are obtained. Then, documents and keywords having higher similarity to the feature vectors calculated with reference to the retrieval and extracting conditions are obtained and displayed. <IMAGE>
|