发明名称 |
IDENTIFYING EQUIVALENT LINKS ON A PAGE |
摘要 |
An illustrative embodiment of a computer-implemented process for identifying equivalent links on a page responsive to a determination that the crawler has not visited all required universal resource locators, locates a next URL to be crawled to form a current URL and processes the current URL to identify equivalent URLs. Responsive to a determination that the crawler has not visited the current URL, determine whether necessary to crawl all identified equivalent URLS and responsive to a determination that it is necessary to crawl all identified equivalent URLS, adding all equivalent URLs to a list of URLs to be crawled. |
申请公布号 |
CA2781391(A1) |
申请公布日期 |
2013.12.26 |
申请号 |
CA20122781391 |
申请日期 |
2012.06.26 |
申请人 |
IBM CANADA LIMITED - IBM CANADA LIMITEE |
发明人 |
ONUT, IOSIF VIOREL;IONESCU, PAUL;AYOUB, KHALIL ANDREW;SMITH, WAYNE DUNCAN |
分类号 |
H04L12/16 |
主分类号 |
H04L12/16 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|