发明名称 METHOD FOR RETRIEVING DOCUMENT
摘要 PROBLEM TO BE SOLVED: To easily and fast retrieve a document including a designated character string from a registered document group. SOLUTION: This document retrieving method comprises a text dividing means for disassembling text being a registered document or a retrieval character string into n-grams (n character set) and words, an n-gram index for holding appearance information about n-grams in the registered document in each n- gram, a word boundary index for holding appearance information about a word boundary in the registered document, a character string unit retrieving means for retrieving a document including the retrieval character string or an appearance position in the document by referring to the n-gram index on the basis of results obtained by dividing the retrieval character string to the n-grams, and a word unit retrieving means for deciding whether the retrieval character string appears as a word by referring to the word boundary index on the basis of results obtained by dividing the retrieval character string into words with respect to results of the character string unit retrieving means and retrieving a document including the retrieval character string as a word.
申请公布号 JP2002269139(A) 申请公布日期 2002.09.20
申请号 JP20010064404 申请日期 2001.03.08
申请人 RICOH CO LTD 发明人 OGAWA YASUTSUGU
分类号 G06F17/30;(IPC1-7):G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址