发明名称 |
A METHOD AND SYSTEM FOR EFFICIENT AND EXHAUSTIVE URL CATEGORIZATION |
摘要 |
The present method and system relate to categorizing URLs (Uniform Resource Locators) of web pages accessed by multiple users over an IP (Internet Protocol) based data network. The method and system collect real time data from IP data traffic occurring on the IP based data network, and extract parameters from the collected real time data, the parameters including an URL of a web page. The URL is processed by a rule based categorization engine, to associate a matching category to the URL of the web page. When no matching category is inferred, the URL is transferred to a semantic based categorization engine. A matching category is associated to the transferred URL by the semantic based categorization engine, based on a semantic analysis of the textual content extracted from the web page associated to the URL. |
申请公布号 |
WO2011069255(A1) |
申请公布日期 |
2011.06.16 |
申请号 |
WO2010CA01952 |
申请日期 |
2010.12.08 |
申请人 |
NEURALITIC SYSTEMS;MIRANDETTE, OLIVIER;TREMBLAY, MARC;MELIN, ERIC |
发明人 |
MIRANDETTE, OLIVIER;TREMBLAY, MARC;MELIN, ERIC |
分类号 |
H04L12/26 |
主分类号 |
H04L12/26 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|