发明名称 RANKING SEARCH RESULTS USING EDIT DISTANCE AND DOCUMENT INFORMATION
摘要 FIELD: information technology.SUBSTANCE: edit distance is employed in determining relevance of the document as result ranking by detecting near-matches of a whole query or part of the query. The edit distance evaluates how close the query string is to a given data stream that includes document information such as TAUC (title, anchor text, URL, clicks) information, etc. The architecture includes the index-time splitting of compound terms in the URL to allow the more effective discovery of query terms. Additionally, index-time filtering of anchor text is used to find the top N anchors of one or more of the document results. The TAUC information can be input to a neural network (e.g., 2-layer) to improve relevance metrics for ranking the search results.EFFECT: improved relevance of search results.19 cl, 12 dwg
申请公布号 RU2501078(C2) 申请公布日期 2013.12.10
申请号 RU20100141559 申请日期 2009.03.10
申请人 MAJKROSOFT KORPOREJSHN 发明人 TANKOVICH VLADIMIR;LI KHAN;MEJERZON DMITRIJ;SJUJ TSZJUN'
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址