发明名称 Efficient computation of document similarity
摘要 Systems, methodologies, media, and other embodiments associated with efficiently computing document similarity are described. One exemplary system embodiment includes logic to produce a gram from a string and logic to identify candidate documents based on identifying matches between query grams and document grams stored in an inverted index that relates grams to documents. The example system may also include logic to selectively partially reconstruct a candidate document from entries in the inverted index and logic to compute an edit distance between a string associated with a query and a string associated with the partially reconstructed candidate document. The example system may also include a signal logic configured to provide a signal corresponding to the edit distance.
申请公布号 US7610281(B2) 申请公布日期 2009.10.27
申请号 US20060606213 申请日期 2006.11.29
申请人 ORACLE INTERNATIONAL CORP. 发明人 GANDHI RIKIN;MATSUDA YASUHIRO;FAISAL MOHAMMAD
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址