发明名称 System and method for identifying text-based SPAM in images using grey-scale transformation
摘要 A system, method and computer program product for identifying spam in an image using grey scale representation of an image, including identifying a plurality of contours in the image, the contours corresponding to probable symbols (letters, numbers, punctuation signs, etc.); ignoring contours that are too small or too large given the specified limits; identifying text lines in the image, based on the remaining contours; parsing the text lines into words; ignoring words that are too short or too long, from the identified text lines; ignoring text lines that are too short; verifying that the image contains text by comparing a number of pixels of a symbol color within remaining contours to a total number of pixels of the symbol color in the image; and if the image contains a text, rendering a spam/no spam verdict based on comparing a signature of the remaining text against a SPAM template.
申请公布号 US7711192(B1) 申请公布日期 2010.05.04
申请号 US20090498196 申请日期 2009.07.06
申请人 发明人 SMIRNOV EVGENY P.
分类号 G06K9/18 主分类号 G06K9/18
代理机构 代理人
主权项
地址