发明名称 CRAWLING COMPUTER-BASED OBJECTS
摘要 Crawling computer-based objects by identifying a dependency between a first portion of a computer-based object set and a second portion of the computer-based object set, where the second portion is data-dependent on the first portion, and responsive to identifying the dependency, effecting a crawling of the first portion and thereafter a crawling of the second portion.
申请公布号 US2016179793(A1) 申请公布日期 2016.06.23
申请号 US201615068799 申请日期 2016.03.14
申请人 International Business Machines Corporation 发明人 Sperling Shahar;Tripp Omer
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A system for crawling computer-based objects, the system comprising: a dependency analyzer configured to identify a dependency between a first portion of a computer-based object set and a second portion of the computer-based object set, wherein the object set comprises a plurality of objects, wherein the first portion and second portion are separate JavaScript methods within an Hypertext Markup Language (HTML) encoded web page and the first and second portion includes executable computer software instructions, wherein the second portion is data-dependent on the first portion and identifying the dependency is based on performing a static analysis and a dependency analysis of the plurality of objects in the computer-based object set, wherein the dependency analyzer is configured to identify an operand of an operation that the second portion is configured to perform, and determine that a value of the operand is dependent upon an operation that the first portion is configured to perform; a crawling manager configured to, responsive to identifying the dependency, effect a crawling of the first portion and a crawling of the second portion, wherein the effecting comprises causes the computer software instruction to be executed, wherein the crawling manager effects the crawling by notifying a separate crawler that the first portion is to be crawled before the second portion is crawled, wherein the separate crawler performs the crawling of the first portion followed by the crawling of the second portion; and wherein the dependency analyzer and the crawling manager are implemented in a computer hardware.
地址 Armonk NY US