发明名称 DOCUMENT PROCESSING APPARATUS AND PROGRAM
摘要 PROBLEM TO BE SOLVED: To increase the identification accuracy of identifying the title of a document according to document data computerizing the document. SOLUTION: A document processing apparatus has: a storage means storing syntactic data representing the syntax of character strings likely or unlikely to be titles of documents; an input means for inputting document data computerizing a document; an extraction means for analyzing the document data input into the input means to extract character string data representing character strings; a parsing means for analyzing each character string data extracted by the extraction means to identify the syntax of a character string described on the document corresponding to the document data for every character string; and a specifying means for specifying character string data representing the title of the document corresponding to the document data from the character string data extracted by the extraction means according to the specifying results by the parsing means and the storage contents of the storage means. COPYRIGHT: (C)2006,JPO&NCIPI
申请公布号 JP2006085582(A) 申请公布日期 2006.03.30
申请号 JP20040271734 申请日期 2004.09.17
申请人 FUJI XEROX CO LTD 发明人 MASUICHI HIROSHI;RYU TSUGUAKI;TAMUNE MICHIHIRO;TAGAWA MASATOSHI;TASHIRO KIYOSHI;ITO ATSUSHI;ISHIKAWA KYOSUKE;SATO NAOKO
分类号 G06F17/21;G06F17/30;G06K9/20 主分类号 G06F17/21
代理机构 代理人
主权项
地址
您可能感兴趣的专利