发明名称 Clustering hypertext with applications to web searching
摘要 A method and structure of searching a database containing hypertext documents comprising searching the database using a query to produce a set of hypertext documents; and geometrically clustering the set of hypertext documents into various clusters using a toric k-means similarity measure such that documents within each cluster are similar to each other, wherein the clustering has a linear-time complexity in producing the set of hypertext documents, wherein the similarity measure comprises a weighted sum of maximized individual components of the set of hypertext documents, and wherein the clustering is based upon words contained in each hypertext document, out-links from each hypertext document, and in-links to each hypertext document.
申请公布号 US6684205(B1) 申请公布日期 2004.01.27
申请号 US20000690854 申请日期 2000.10.18
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 MODHA DHARMENDRA SHANTILAL;SPANGLER WILLIAM SCOTT
分类号 G06F17/30;(IPC1-7):G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址