发明名称 |
DOCUMENT PROCESSING APPARATUS AND PROGRAM |
摘要 |
PROBLEM TO BE SOLVED: To increase the identification accuracy of identifying the title of a document according to document data computerizing the document. SOLUTION: A document processing apparatus has: a storage means storing syntactic data representing the syntax of character strings likely or unlikely to be titles of documents; an input means for inputting document data computerizing a document; an extraction means for analyzing the document data input into the input means to extract character string data representing character strings; a parsing means for analyzing each character string data extracted by the extraction means to identify the syntax of a character string described on the document corresponding to the document data for every character string; and a specifying means for specifying character string data representing the title of the document corresponding to the document data from the character string data extracted by the extraction means according to the specifying results by the parsing means and the storage contents of the storage means. COPYRIGHT: (C)2006,JPO&NCIPI
|
申请公布号 |
JP2006085582(A) |
申请公布日期 |
2006.03.30 |
申请号 |
JP20040271734 |
申请日期 |
2004.09.17 |
申请人 |
FUJI XEROX CO LTD |
发明人 |
MASUICHI HIROSHI;RYU TSUGUAKI;TAMUNE MICHIHIRO;TAGAWA MASATOSHI;TASHIRO KIYOSHI;ITO ATSUSHI;ISHIKAWA KYOSUKE;SATO NAOKO |
分类号 |
G06F17/21;G06F17/30;G06K9/20 |
主分类号 |
G06F17/21 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|