发明名称 |
WEB GRAPH COMPRESSION THROUGH SCALABLE PATTERN MINING |
摘要 |
A method and a processing device are provided for compressing a web graph including multiple nodes and links between the multiple nodes. Nodes of the web graph may be clustered into groups including no more than a predetermined number of nodes. A list of links of the clustered nodes may be created and sorted based on a frequency of occurrence of each of the links. A prefix tree may be created based on the sorted list of links. The prefix tree may be walked to find candidate virtual nodes. The candidate virtual nodes may be analyzed according to a selection criteria and a virtual node may be selected. The prefix tree may be adjusted to account for the selection of the virtual node and the virtual node may be added to the web graph.
|
申请公布号 |
US2010332476(A1) |
申请公布日期 |
2010.12.30 |
申请号 |
US20100880747 |
申请日期 |
2010.09.13 |
申请人 |
MICROSOFT CORPORATION |
发明人 |
BUEHRER GREGORY T.;CHELLAPILLA KUMAR HEMACHANDRA |
分类号 |
G06F7/00;G06F17/30 |
主分类号 |
G06F7/00 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|