发明名称 CONFIGURING WEB CRAWLER TO EXTRACT WEB PAGE INFORMATION
摘要 <p>Web crawling configuration includes: obtaining a webpage comprising a plurality of receiving a user selection of a node in the webpage; presenting a set of web crawling configuration options pertaining to a web crawling action to be performed with respect to the node, the set of web crawling configuration options depending at least in part on a type of an element included in the node and comprising: a first option to perform a first web crawling action in the event that the node include a first type of the element; and a second option to perform a second web crawling action in the event that the node includes a second type of the element; receiving a user input specifying the web crawling configuration option; and storing user specified web crawling configuration option, performing the web crawling action on the node according to the user input, or both.</p>
申请公布号 EP2734934(A1) 申请公布日期 2014.05.28
申请号 EP20120740840 申请日期 2012.07.19
申请人 ALIBABA GROUP HOLDING LIMITED 发明人 SUN, YIMING;QIANG, QI;CAI, BOYANG;JIN, XIAOJUN;WU, ZONGYUAN
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址