发明名称 |
System and method for selecting a sub-domain for a specified domain of the web |
摘要 |
A selection system and method. The selection method comprises receiving, by a computing system, a taxonomy of data related to a specified domain of knowledge on the web. A taxonomy tree is constructed from the taxonomy. A sub tree related to a sub-domain from specified domain is selected from the taxonomy tree. A first list comprising user expected universal resource locators (URLs) related to the sub-domain is received. A second list comprising topic expressions defining each node of the taxonomy sub-tree is generated. A query based on the second list is generated. The query is applied on an index of URLs generated from a web crawling process to generate a third list. A recall value is calculated based on the first list and the third list.
|
申请公布号 |
US2007266016(A1) |
申请公布日期 |
2007.11.15 |
申请号 |
US20060432265 |
申请日期 |
2006.05.11 |
申请人 |
INTERNATIONAL BUSINESS MACHINES CORPORATION |
发明人 |
HOLMES SCOTT R.;MI HONGCHENG;NEGI SUMIT;ZHANG ZENGYAN |
分类号 |
G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|