摘要 |
An object of the present invention is to provide a document processing device and document processing method that can provide a search result satisfactory to a user with respect to WWW documents in which a number of links among WWW documents is low and a number of accesses by users is low. An access pattern collection unit 101 generates an access user vector uj of one WWW document Dj and an access user vector uje of another document Dje. A user similarity computing unit 105 computes a document similarity sim (uj, uje) which indicates a user similarity between the WWW document Dj and WWW document Dje. A keyword vector smoothing unit 106 acquires a smoothed keyword weight vector w'j by correcting a keyword weight vector wj in one document, using the computed document similarity sim (uj, uje). An rearranging unit 110 calculates an evaluation value B_SCORE for input information for searching, based on the smoothed keyword weight vector w'j.
|