发明名称 Sensitivity Categorization of Web Pages
摘要 Methods, systems, and computer programs for categorizing the sensitivity of web pages are presented. In one method, a space of sensitive pages is identified based on the sensitivity categorization of a first plurality of web pages and a second plurality of web pages. The first plurality of web pages is obtained by performing search queries using known sensitive words, and the second plurality of web pages includes randomly selected web pages. Additionally, the method identifies a third plurality of web pages that includes web pages on or near the boundary between the space of sensitive pages and the space of non-sensitive pages. The space of sensitive pages is then redefined based on the sensitivity categorization of the first, second, and third pluralities of web pages. Once the space of sensitive pages is defined, the method is used to determine that a given web page is sensitive when the given web page is in the space of sensitive pages. Web pages are included in a marketing operation when the web pages are not sensitive.
申请公布号 US2011184817(A1) 申请公布日期 2011.07.28
申请号 US20100696006 申请日期 2010.01.28
申请人 YAHOO!, INC. 发明人 YANKOV DRAGOMIR;RAJAN SUJU;GAFFNEY SCOTT J.;LIU DUN;PANG WANLIN;RATNAPARKHI ADWAIT
分类号 G06Q30/00;G06F15/18;G06F17/00;G06F17/30 主分类号 G06Q30/00
代理机构 代理人
主权项
地址