发明名称 Detecting image spam
摘要 Methods and systems for operation upon one or more data processors for detecting image spam by detecting an image and analyzing the content of the image to determine whether the incoming communication comprises an unwanted communication.
申请公布号 US8763114(B2) 申请公布日期 2014.06.24
申请号 US200711626568 申请日期 2007.01.24
申请人 McAfee, Inc. 发明人 Alperovitch Dmitri;Black Nick;Gould Jeremy;Judge Paul;Krasser Sven;Schneck Phyllis Adele;Tang Yuchun;Trivedi Aarjav Jyotindra Neeta;Willis Lamar Lorenzo;Yang Weilai;Zdziarski Jonathan Alexander
分类号 G08B23/00;G06K9/62;H04L29/06;G06F15/16 主分类号 G08B23/00
代理机构 Patent Capital Group 代理人 Patent Capital Group
主权项 1. A computer implemented method comprising: receiving an incoming communication associated with a particular entity; identifying that the communication contains one or more images; determining whether one or more of the images includes a graphic encoding of a textual spam message, wherein the determining includes, for each of the one or more images: normalizing the image to generate a normalized image, wherein the normalized image is a representation of the image with at least some noise removed from the image and the normalized image comprises overlapping sub-regions having image data;determining whether the image includes graphical encoding of text corresponding to text in known spam;determining whether aspect ratio of the image corresponds to a known aspect ratio corresponding to known spam;generating fingerprints of the image data from the overlapping sub-regions of the normalized image, wherein the fingerprints specify attributes of the normalized image;comparing the fingerprints from the normalized image with fingerprints of known spam images, wherein the comparison comprises for at least one of the fingerprints: determining a first measure that the at least one of the fingerprints is similar to at least one fingerprint of a known spam image;determining a second measure that the at least one of the fingerprints is similar to at least one fingerprint of a known non-spam image; andclassifying the image as a spam image or a non-spam image based at least in part on the determination of whether the image includes graphical encoding of text corresponding to text in known spam, the determination of whether the aspect ratio of the image corresponds to a known aspect ratio corresponding to known spam, and the comparison of the fingerprints; and updating a reputation score for the particular entity based at least in part upon a result of the classification, wherein the reputation score for the particular entity is further based on a strength of relationship between the particular entity and a first entity having a reputable reputation score and the strength of relationship is based on similarities between content in messages sent by the particular entity and content of messages sent by the first entity.
地址 Santa Clara CA US