发明名称 |
Link spam detection using smooth classification function |
摘要 |
A spam detection system is disclosed. The system includes a classifier training component that receives a first set of training pages labeled as normal pages and a second set of training pages labeled as spam pages. The training component trains a web page classifier based on both the first set of training pages and the second set of training pages. A spam detector then receives unlabeled web pages uses the web page classifier to classify the unlabeled web pages as spam pages or normal pages. |
申请公布号 |
US8805754(B2) |
申请公布日期 |
2014.08.12 |
申请号 |
US201313921862 |
申请日期 |
2013.06.19 |
申请人 |
Microsoft Corporation |
发明人 |
Zhou Dengyong;Burges Christopher;Tao Tao |
分类号 |
G06F15/18 |
主分类号 |
G06F15/18 |
代理机构 |
|
代理人 |
Choi Dan;Boelitz Carole;Minhas Micky |
主权项 |
1. A spam detection system, comprising:
a classifier training component that receives a first set of training pages labeled as normal pages and a second set of training pages labeled as spam pages, wherein the classifier training component trains a web page classifier based on both the first set of training pages and the second set of training pages; and a spam detector that receives unlabeled web pages and, utilizing a computer processor, applies the web page classifier so as to classify the unlabeled web pages as either spam pages or normal pages. |
地址 |
Redmond WA US |