摘要 |
<P>PROBLEM TO BE SOLVED: To provide a document classifying device for automatically and accurately classifying a document to a proper person in charge. <P>SOLUTION: This document classifying device(mail analyzing device) 2 is provided with a dictionary 4 constituted of a significance dictionary 41 in which a tf/idf value is stored for each word appearing in the category and a simultaneous appearance dictionary 42 in which an idf/conf value is stored for each of the combination of the first word and the second word for the word appearing in the category. The document classifying device 2 is configured to calculate the tf/idf value and idf/conf value of each word of the dictionary by collating the word appearing in an inputted document in the dictionary 4, and to calculate the scores of each category on the basis of the scores of each word calculated by performing a predetermined arithmetic operation based on them, and to classify the inputted document into any of the plurality of categories on the basis of this. <P>COPYRIGHT: (C)2005,JPO&NCIPI |