发明名称 |
METHOD AND SYSTEM FOR PROCESSING CHARACTER FOR DOCUMENT RETRIEVAL |
摘要 |
PROBLEM TO BE SOLVED: To provide a method and system for processing characters for document retrieval which prepares indexes by which full text retrieval of a document including Rubis (kanas placed alongside Chinese characters) can be performed. SOLUTION: This character processing method and system for preparing the indexes for document retrieval is provided, which performs the steps for: acquiring text data with font types and size information attached thereto from an electronic document; reading the text data in a series of character string units; determining a character type in a read character string; storing character strings in their own storage places on the basis of determined character types; and arranging the respective storage places according to a prescribed sequence to prepare indexes after all determination and storage are completed in the character strings. COPYRIGHT: (C)2004,JPO
|
申请公布号 |
JP2004013863(A) |
申请公布日期 |
2004.01.15 |
申请号 |
JP20020170768 |
申请日期 |
2002.06.12 |
申请人 |
DAINIPPON PRINTING CO LTD |
发明人 |
ITO TAKAKO;ISHII HIROAKI |
分类号 |
G06F17/30;(IPC1-7):G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|