发明名称 Systems and methods for identifying user types using multi-modal clustering and information scent
摘要 Techniques for determining user types based on multi-modal clustering are provided. The topology, content and usage of a document collection or web site is determined. The user paths are identified using longest repeating subsequence techniques and a multi-modal information need vector is determined for each significant user path. Multi-modal vectors for each document in the significant path, content, uniform resource locators, inlink and outlink multi-modal vectors are determined and combined based on path position and access frequency. Multi-modal clustering is performed based on a multi-modal similarity function and a specified measure of similarity using a type of multi-modal clustering such as K-means or wavefront clustering. The identified clusters may be further analyzed based on changes to the weighting of the corresponding content, url, inlinks and outlinks multi-modal feature vectors.
申请公布号 EP1739576(A2) 申请公布日期 2007.01.03
申请号 EP20060017595 申请日期 2002.03.28
申请人 XEROX CORPORATION 发明人 CHI, ED H.;HEER, JEFFREY;PIROLLI, PETER L.
分类号 G06F17/30;G06F19/00 主分类号 G06F17/30
代理机构 代理人
主权项
地址