发明名称 Unsupervised learning tool for feature correction
摘要 Techniques for correcting miscategorized features excerpted from web pages are provided. For each of several categories and several pages on a particular web site, a separate feature may be excerpted from that page and associated with that page in relation to that category. Often, many of the "high confidence" features that have been associated with the same category are found to be associated with similar characteristics regardless of the pages from which those features were excerpted. Thus, a set of category characteristics, which are often found associated with the "high confidence" features in a particular category, may be determined. For each page, a candidate feature that is associated with the set of category characteristics may be identified in that page. If, in relation to the particular category, a feature other than the candidate feature is associated with that page, then that other feature may be replaced by the candidate feature.
申请公布号 US7483903(B2) 申请公布日期 2009.01.27
申请号 US20050253023 申请日期 2005.10.17
申请人 YAHOO! INC. 发明人 KULKARNI PARASHURAM;RAJ BINU
分类号 G06F7/00;G06F12/00;G06F17/00;G06F17/30 主分类号 G06F7/00
代理机构 代理人
主权项
地址