发明名称 |
Analyzing uniform resource locators |
摘要 |
Methods for analyzing a Uniform Resource Locator (URL) and apparatus for performing such methods. The methods include parsing the URL into text segments and generating n-grams from the text segments. The methods further include generating annotations, each annotation corresponding to one of the n-grams and comprising a match value for its corresponding n-gram, a description of its match value, and a score. The methods still further include selecting a subset of the annotations. |
申请公布号 |
US9286408(B2) |
申请公布日期 |
2016.03.15 |
申请号 |
US201313754048 |
申请日期 |
2013.01.30 |
申请人 |
Hewlett-Packard Development Company, L.P. |
发明人 |
Koutrika Georgia |
分类号 |
G06F7/00;G06F17/30;G06F17/27;G06F17/24 |
主分类号 |
G06F7/00 |
代理机构 |
Dicke, Billig & Czaja, PLLC |
代理人 |
Dicke, Billig & Czaja, PLLC |
主权项 |
1. A method of analyzing a Uniform Resource Locator (URL), comprising:
obtaining the URL from an Internet browser instantiated and displayed on a computer; and performing, by a processor:
parsing the URL into text segments;generating n-grams from the text segments of the URL;comparing the n-grams generated from the text segments of the URL to at least one knowledge base;generating annotations, each annotation corresponding to one of the n-grams generated from the text segments of the URL and comprising a match value for its corresponding n-gram determined from the comparison to the at least one knowledge base, a description of its match value from the at least one knowledge base, and a score indicating a relative confidence level that an association between the corresponding n-gram and the description of its match value is correct; andselecting a subset of the annotations. |
地址 |
Houston TX US |