发明名称 WEB GRAPH COMPRESSION THROUGH SCALABLE PATTERN MINING
摘要 A method and a processing device are provided for compressing a web graph including multiple nodes and links between the multiple nodes. Nodes of the web graph may be clustered into groups including no more than a predetermined number of nodes. A list of links of the clustered nodes may be created and sorted based on a frequency of occurrence of each of the links. A prefix tree may be created based on the sorted list of links. The prefix tree may be walked to find candidate virtual nodes. The candidate virtual nodes may be analyzed according to a selection criteria and a virtual node may be selected. The prefix tree may be adjusted to account for the selection of the virtual node and the virtual node may be added to the web graph.
申请公布号 US2010332476(A1) 申请公布日期 2010.12.30
申请号 US20100880747 申请日期 2010.09.13
申请人 MICROSOFT CORPORATION 发明人 BUEHRER GREGORY T.;CHELLAPILLA KUMAR HEMACHANDRA
分类号 G06F7/00;G06F17/30 主分类号 G06F7/00
代理机构 代理人
主权项
地址