摘要 |
PROBLEM TO BE SOLVED: To provide a document sorting device and method for sorting a document by dividing a document set into subsets by using attribute information and outputting a group characteristic to the subsets. SOLUTION: The device consists of a document record holding part for storing the set of a text field and an attribute information field, a document record set dividing part for dividing the document record set held there into a plurality of subsets, a text group holding part for holding the recognition number of a field which is used for dividing here and dividing condition information determined in advance for division, a text group generation part for generating a document group by each divided subset based on the text of the text field designated by the text group holding part, a group characteristic degree calculation part for calculating an index showing whether the group generated by the text group generation part is characteristic to the divided subsets by referring to the text group holding part, and a document group output part for outputting each document group with the identification information of the divided subsets based on the characteristic degree calculated by the group characteristic degree calculation part.
|