发明名称 |
Apparatus and method for detecting spam |
摘要 |
Provided is a process of detecting spam in websites, the process including: obtaining text from a website; detecting an amount of transitions between character sets in the text, wherein the character sets each correspond to different alphabets; calculating, with a computer, a score indicative of the likelihood that the text is spam based on the amount of transitions; and labeling the text as spam based on the score. |
申请公布号 |
US9465789(B1) |
申请公布日期 |
2016.10.11 |
申请号 |
US201313851199 |
申请日期 |
2013.03.27 |
申请人 |
GOOGLE INC. |
发明人 |
Chen Roger;Frew Kevin Thomas |
分类号 |
G06F15/16;G06F17/27 |
主分类号 |
G06F15/16 |
代理机构 |
Middleton Reutlinger |
代理人 |
Middleton Reutlinger |
主权项 |
1. A method of detecting spam in web sites, the method comprising:
obtaining a text entry; detecting a number of character transitions between character sets in the text entry, wherein the character sets (i) each correspond to different alphabets and (ii) are different subsets of a character encoding that maps characters from multiple alphabets to respective binary numbers for use by computers; calculating, with a computer, a score indicative of the likelihood that the text entry is spam based on the number of character transitions; and labeling the text entry as spam based on the score. |
地址 |
Mountain View CA US |