发明名称 |
LINK SPAM DETECTION USING SMOOTH CLASSIFICATION FUNCTION |
摘要 |
<p>A collection of web pages is considered as a directed graph in which the pages themselves are nodes and the hyperlinks between the pages are directed edges in the graph. A trusted entity identifies training examples for spam pages and normal pages. A random walk is conducted through the directed graph that includes the collection of web pages and the stationary probabilities, and transitional probabilities, among the nodes in the directed graph are obtained. A classifier training component estimates a classification function that changes slowly on densely connected subgraphs within the directed graph. The classification function assigns a value to each of the nodes in the directed graph and identifies them as spam or normal pages based upon whether the value meets a given function threshold value.</p> |
申请公布号 |
WO2008137360(A1) |
申请公布日期 |
2008.11.13 |
申请号 |
WO2008US61637 |
申请日期 |
2008.04.25 |
申请人 |
MICROSOFT CORPORATION |
发明人 |
ZHOU, DENGYONG;BURGES, CHRISTOPHER J.C.;TAO, TAO |
分类号 |
G06K13/00;H04L12/66 |
主分类号 |
G06K13/00 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|