发明名称 A SEARCH SYSTEM AND METHOD FOR RETRIEVAL OF DATA, AND THE USE THEREOF IN A SEARCH ENGINE
摘要 A search system for information retrieval comprises a data structure in the form of a non-evenly spaced sparse suffix tree for storing suffixes of words and/or symbols, or sequences thereof, in a text T, a metric M comprising combining edit distance metrics for an approximate degree of matching respectively between words and/or symbols, or between sequences thereof, in the text T and a query Q, the latter distance metric including weighting cost functions for edit operations which transform a sequence S of the text into a sequence P of the query Q, and search algorithms for determining the degree of matching respectively between words and/or symbols, or between sequences thereof, in respectively the text T and the query Q, such that information R is retrieved with a specified degree of matching with the query Q. Optionally the search system also comprises algorithms for determining exact matching such that information R may be retrieved with an exact degree of matching with the query Q. A method in the search system comprises generating the data structure as a word-spaced sparse suffix tree, storing sequence information of the words in the text T in the generated suffix tree, generating a combined edit distant metric for words or sequences thereof in the text T and a query word q or sequences thereof in the query Q and including word-weighting cost functions for the sequence-transforming edit operations, and determining the degree of matching between retrieved information R and a query Q. - Use in an approximate search engine.
申请公布号 CA2337079(C) 申请公布日期 2006.07.04
申请号 CA19992337079 申请日期 1999.07.09
申请人 FAST SEARCH & TRANSFER ASA 发明人 RISVIK, KNUT MAGNE
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址