发明名称 METHOD AND SYSTEM FOR PROCESSING CHARACTER FOR DOCUMENT RETRIEVAL
摘要 PROBLEM TO BE SOLVED: To provide a method and system for processing characters for document retrieval which prepares indexes by which full text retrieval of a document including Rubis (kanas placed alongside Chinese characters) can be performed. SOLUTION: This character processing method and system for preparing the indexes for document retrieval is provided, which performs the steps for: acquiring text data with font types and size information attached thereto from an electronic document; reading the text data in a series of character string units; determining a character type in a read character string; storing character strings in their own storage places on the basis of determined character types; and arranging the respective storage places according to a prescribed sequence to prepare indexes after all determination and storage are completed in the character strings. COPYRIGHT: (C)2004,JPO
申请公布号 JP2004013863(A) 申请公布日期 2004.01.15
申请号 JP20020170768 申请日期 2002.06.12
申请人 DAINIPPON PRINTING CO LTD 发明人 ITO TAKAKO;ISHII HIROAKI
分类号 G06F17/30;(IPC1-7):G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址