发明名称 Apparatus and method for detecting forgery/falsification of homepage
摘要 An apparatus and method for detecting forgery/falsification of a homepage. The apparatus includes a homepage image shot generation module for generating homepage image shots of an entire screen of an accessed homepage. A character string extraction module extracts character strings from each homepage image shot using an OCR technique. A character string comparison module compares each of the extracted character strings with character strings required for determination of homepage forgery/falsification, thus determining whether the extracted character string is a normal character string or a falsified character string. A homepage falsification determination module determines whether the corresponding homepage has been forged/falsified, based on results of the comparison. A character string learning module learns the character string extracted from the homepage image shot, based on results of the determination, and classifies the character string as the normal character string or the falsified character string.
申请公布号 US9323987(B2) 申请公布日期 2016.04.26
申请号 US201414467677 申请日期 2014.08.25
申请人 ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE 发明人 Lee Taek kyu;Kim Geun Yong;Lee Seok won;Choi Myeong Ryeol;Oh Hyung Geun;Sohn KiWook
分类号 G06K9/62;G06K9/00;G06K9/34;G06K9/72 主分类号 G06K9/62
代理机构 LRK Patent Law Firm 代理人 LRK Patent Law Firm
主权项 1. An apparatus for detecting forgery/falsification of a homepage, comprising: one or more modules being configured and executed by a processor using algorithms associated with at least one non-transitory storage device, the one or more modules comprising, a homepage image shot generation module configured to generate homepage image shots of an entire screen of an accessed homepage; a character string extraction module configured to extract character strings from each homepage image shot using an Optical Character Recognition (OCR) technique; a character string comparison module configured to detect whether the extracted character string is a normal character string or a falsified character string for detecting homepage forgery/falsification by comparing each of the extracted character strings with character strings required for determination of homepage forgery/falsification; a homepage falsification determination module configured to determine whether the corresponding homepage has been forged/falsified according to the comparison; and a character string learning module configured to compare the character strings extracted using the homepage image shot, to determine whether newly detected character strings are normal character strings using normality determination reference character strings based on the comparison classify the character strings based on the normal character strings or the falsified character strings according to the determination, register the character strings extracted from the corresponding homepage image shot using the OCR technique upon detection of a normal character string, and assign a weight to a character string repeatedly appearing with respect to the character strings extracted from previous image shots.
地址 Daejeon KR