发明名称 SYSTEM AND METHOD FOR INDEXING WEB CONTENT USING CLICK-THROUGH FEATURES
摘要 The system and method of the present invention allows for the determination of the relevance of a content item to a query through the use of a machine learned relevance function that incorporates click-through features of the content items. A method for selecting a relevance function to determine a relevance of a query-content item pair comprises generating training set having one or more query-URL pairs labeled for relevance based on their click- through features. The labeled query-URL pairs are used to determine the relevance function by minimizing a loss function that accounts for click-through features of the content item. The computed relevance function is then applied to the click-though features of unlabeled content items to assign relevance scores thereto. An inverted click-through index of query-score pairs is formed and combined with the content index to improve relevance of search results.
申请公布号 WO2007127676(A1) 申请公布日期 2007.11.08
申请号 WO2007US67075 申请日期 2007.04.20
申请人 YAHOO!, INC.;SUN, GORDON;ZHENG, ZHAOHUI 发明人 SUN, GORDON;ZHENG, ZHAOHUI
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址