发明名称 System and method for identifying web communities from seed sets of web pages
摘要 An improved system and method is provided for identifying web communities from seed sets of web pages. A seed set of web pages may be represented as a set of seed vertices of a graph representing a collection of web pages. An initial probability distribution may be constructed on vertices of the graph by assigning a nonzero value to the vertices belonging to the seed set. Then a sequence of probability distributions may be produced on the vertices of the graph by modifying the probability distribution over a series of one-step walks of the probability distribution over the vertices of the graph. For each probability distribution produced in the sequence, level sets of vertices may be generated, and a level set with minimal conductance may be selected for each probability distribution. The level set with the least conductance may then be output representing a community of web pages.
申请公布号 US2008052263(A1) 申请公布日期 2008.02.28
申请号 US20060510412 申请日期 2006.08.24
申请人 YAHOO! INC. 发明人 ANDERSEN REID MARLOW;LANG KEVIN JOHN
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址