发明名称 DOCUMENT PROCESSING APPARATUS, DOCUMENT PROCESSING METHOD AND PROGRAM OF DOCUMENT PROCESSING APPARATUS
摘要 <P>PROBLEM TO BE SOLVED: To omit a useless process and to improve analytical precision when extracting semantic information by optimizing selection and formation of an analysis component of extracting the semantic information of image data according to the features of the image data. <P>SOLUTION: A semantic information analysis section 23 of this document processing apparatus 230 includes: a text area information calculation section 24 for calculating position information of a text area in the image data; a feature extraction section 25 for extracting features of the image data on the basis of a calculation result in a text area information calculation; a component formation section 26 for selecting an analysis component to be applied on the basis of the extracted features, and determining an order to apply analysis components when selecting a plurality of analysis components; and an analysis execution section 27 for actually and dynamically applying a module to analyze semantic information. <P>COPYRIGHT: (C)2009,JPO&INPIT
申请公布号 JP2009110500(A) 申请公布日期 2009.05.21
申请号 JP20080199231 申请日期 2008.08.01
申请人 TOSHIBA CORP;TOSHIBA TEC CORP 发明人 FUJIWARA AKIHIKO
分类号 G06F17/21 主分类号 G06F17/21
代理机构 代理人
主权项
地址