发明名称 System and method of analyzing web content
摘要 A system and method are provided for identifying inappropriate content in websites on a network. Unrecognized uniform resource locators (URLs) or other web content are accessed by workstations and are identified as possibly having malicious content. The URLs or web content may be preprocessed within a gateway server module or some other software module to collect additional information related to the URLs. The URLs may be scanned for known attack signatures, and if any are found, they may be tagged as candidate URLs in need of further analysis by a classification module.
申请公布号 US8978140(B2) 申请公布日期 2015.03.10
申请号 US201113164688 申请日期 2011.06.20
申请人 Websense, Inc. 发明人 Hubbard Dan;Verenini Nicholas Joseph;Baddour Victor Louie
分类号 G06F12/14;G06F17/30 主分类号 G06F12/14
代理机构 Knobbe Martens Olson & Bear LLP 代理人 Knobbe Martens Olson & Bear LLP
主权项 1. A computer-implemented method of categorizing a uniform resource locator (URL) based on web content associated with the URL, the method comprising: identifying a first URL using a first collection method of a plurality of collection methods, wherein each of the plurality of collection methods is performed using at least one electronic processor; determining, using an electronic processor, whether the first URL contains a malicious data element; categorizing, using an electronic processor, the first URL in response to a determination that the first URL contains a malicious data element; in response to determining the first URL does not contain a malicious data element: assigning, using an electronic processor, a first categorization priority to the first URL based on the first URL being identified using the first collection method, andcategorizing, using an electronic processor, the first URL based on the first categorization priority, wherein categorization of a URL comprises assigning a category to the URL based on a classification of at least one of web content or an Internet Protocol (IP) address identified by the URL; identifying a second URL using a second collection method, wherein the first collection method and the second collection method are different and each are one of a web crawler, a Domain Name Server (DNS) database, and a honey client; determining, using an electronic processor, whether the second URL contains a malicious data element; categorizing, using an electronic processor, the second URL in response to a determination that the second URL contains a malicious data element; in response to determining the second URL does not contain a malicious data element: assigning, using an electronic processor, a second categorization priority different than the first categorization priority based on the second URL having been identified using the second collection method, andcategorizing, using an electronic processor, the second URL based on the second categorization priority.
地址 San Diego CA US