发明名称 DOCUMENT PROCESSING DEVICE AND DOCUMENT PROCESSING METHOD
摘要 <P>PROBLEM TO BE SOLVED: To improve information extraction accuracy from a document file. <P>SOLUTION: A word of a learning corpus 200 is classified into one of a plurality of classes. The document processing device 100 holds the identity of the word in the learning corpus 200 as class identity information in each class in a class identity holding section 170. The document processing device 100 extracts a word from a document 210 to be inspected before processing, calculates the identity of the word and adaptation of the class identity information in the document 210 to be inspected before processing for each of the plurality of classes, adjusts the adaptation calculated for a predetermined class, and specifies the class corresponding to the word extracted based on the adaptation for each class. The specified class name is added as a tag, thereby creating a document 212 to be inspected after processing. <P>COPYRIGHT: (C)2008,JPO&INPIT
申请公布号 JP2007304950(A) 申请公布日期 2007.11.22
申请号 JP20060133828 申请日期 2006.05.12
申请人 JUST SYST CORP 发明人 KASHIMOTO SEIJI
分类号 G06F17/28;G06F17/30 主分类号 G06F17/28
代理机构 代理人
主权项
地址