发明名称 |
CONFIGURING WEB CRAWLER TO EXTRACT WEB PAGE INFORMATION |
摘要 |
<p>Web crawling configuration includes: obtaining a webpage comprising a plurality of receiving a user selection of a node in the webpage; presenting a set of web crawling configuration options pertaining to a web crawling action to be performed with respect to the node, the set of web crawling configuration options depending at least in part on a type of an element included in the node and comprising: a first option to perform a first web crawling action in the event that the node include a first type of the element; and a second option to perform a second web crawling action in the event that the node includes a second type of the element; receiving a user input specifying the web crawling configuration option; and storing user specified web crawling configuration option, performing the web crawling action on the node according to the user input, or both.</p> |
申请公布号 |
EP2734934(A1) |
申请公布日期 |
2014.05.28 |
申请号 |
EP20120740840 |
申请日期 |
2012.07.19 |
申请人 |
ALIBABA GROUP HOLDING LIMITED |
发明人 |
SUN, YIMING;QIANG, QI;CAI, BOYANG;JIN, XIAOJUN;WU, ZONGYUAN |
分类号 |
G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|