发明名称 Systems and methods for identifying user types using multi-modal clustering and information scent
摘要 Techniques for determining user types based on multi-modal clustering are provided. The topology, content and usage of a document collection or web site is determined. The user paths are identified using longest repeating subsequence techniques and a multi-modal information need vector is determined for each significant user path. Multi-modal vectors for each document in the significant path, content, uniform resource locators, inlink and outlink multi-modal vectors are determined and combined based on path position and access frequency. Multi-modal clustering is performed based on a multi-modal similarity function and a specified measure of similarity using a type of multi-modal clustering such as K-means or wavefront clustering. The identified clusters may be further analyzed based on changes to the weighting of the corresponding content, url, inlinks and outlinks multi-modal feature vectors.
申请公布号 US7260643(B2) 申请公布日期 2007.08.21
申请号 US20010820988 申请日期 2001.03.30
申请人 XEROX CORPORATION 发明人 CHI ED H.;HEER JEFFERY M;PIROLLI PETER L. T.
分类号 G06F15/173;G06F19/00;G06F17/30 主分类号 G06F15/173
代理机构 代理人
主权项
地址