摘要 |
PROBLEM TO BE SOLVED: To provide a new method for finding similarity between documents. SOLUTION: A similarity between documents is found by using at least either a co-occurrence vector or a sentence type vector in addition to a TF-IDF vector. Accordingly, the similarity, which more reflects the meaning and contents of the documents, can be found. COPYRIGHT: (C)2008,JPO&INPIT
|