发明名称 SYSTEMS AND METHODS FOR SPAM DETECTION USING CHARACTER HISTOGRAMS
摘要 Described spam detection techniques including string identification, pre-filtering, and character histogram and timestamp comparison steps facilitate accurate, computationally-efficient detection of rapidly-changing spam arriving in short-lasting waves. In some embodiments, a computer system extracts a target character string from an electronic communication such as a blog comment, transmits it to an anti-spam server, and receives an indicator of whether the respective electronic communication is spam or non-spam from the anti-spam server. The anti-spam server determines whether the electronic communication is spam or non-spam according to certain features of the character histogram of the target string. Some embodiments also perform an unsupervised clustering of incoming target strings into clusters, wherein all members of a cluster have similar character histograms.
申请公布号 HK1198850(A1) 申请公布日期 2015.06.12
申请号 HK20140112331 申请日期 2014.12.08
申请人 发明人 DICHIU, DANIEL;LUPSESCU, Z. LUCIAN Z
分类号 H04L 主分类号 H04L
代理机构 代理人
主权项
地址