摘要 |
<P>PROBLEM TO BE SOLVED: To omit a useless process and to improve analytical precision when extracting semantic information by optimizing selection and formation of an analysis component of extracting the semantic information of image data according to the features of the image data. <P>SOLUTION: A semantic information analysis section 23 of this document processing apparatus 230 includes: a text area information calculation section 24 for calculating position information of a text area in the image data; a feature extraction section 25 for extracting features of the image data on the basis of a calculation result in a text area information calculation; a component formation section 26 for selecting an analysis component to be applied on the basis of the extracted features, and determining an order to apply analysis components when selecting a plurality of analysis components; and an analysis execution section 27 for actually and dynamically applying a module to analyze semantic information. <P>COPYRIGHT: (C)2009,JPO&INPIT |