摘要 |
A document processing apparatus, including a symbol-related information acquirement unit which identifies a text area in a scanned document, extracts symbols from the identified text area, and acquires symbol-related information regarding each extracted symbol, a symbol division unit which divides the extracted symbols into several groups based on a preset reference value regarding the symbol-related information, and a key index generation unit which generates a key index by arranging one group of symbols from among the divided groups. Accordingly, a user can look for a desired document more easily and conveniently. |