摘要 |
PROBLEM TO BE SOLVED: To provide a document processing apparatus and program that can create rules enabling highly accurate classification of documents.SOLUTION: A document processing apparatus includes a reception unit 32 that receives image information on documents, a character information extraction unit 34 that extracts character information including character strings from the received image information on the documents, a classification unit 36 that classifies the documents received by the reception unit 32 on the basis of the character information extracted by the character information extraction unit 34, and a classification rule creation unit 38 that creates classification rules for the classification unit 36 so as to adjust the redundancy of character string recognition by the character information extraction unit 34. |