发明名称 IDENTIFICATION OF SAMPLE DATA ITEMS FOR RE-JUDGING
摘要 Described is a technology for identifying sample data items (e.g., documents corresponding to query-URL pairs) having the greatest likelihood of being mislabeled when previously judged, and selecting those data items for re-judging. In one aspect, lambda gradient scores (information associated with ranked sample data items that indicates a relative direction and how “strongly” to move each data item for lowering a ranking cost) are summed for pairs of sample data items to compute re-judgment scores for each of those sample data items. The re-judgment scores indicate a relative likelihood of mislabeling. Once the selected sample data items are re-judged, a new training set is available, whereby a new ranker may be trained.
申请公布号 US2010318540(A1) 申请公布日期 2010.12.16
申请号 US20090484256 申请日期 2009.06.15
申请人 MICROSOFT CORPORATION 发明人 SVORE KRYSTA M.;ABIB ELBIO RENATO TORRES;BURGES CHRISTOPHER J.C.;MIDDHA BHUVAN
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址