发明名称 Systems and methods for identifying user types using multi-modal clustering and information scent
摘要 <p>Techniques for determining user types based on multi-modal clustering are provided. The topology, content and usage of a document collection or web site is determined. The user paths are identified using longest repeating subsequence techniques and a multi-modal information need vector is determined for each significant user path. Multi-modal vectors for each document in the significant path, content, uniform resource locators, inlink and outlink multi-modal vectors are determined and combined based on path position and access frequency. Multi-modal clustering is performed based on a multi-modal similarity function and a specified measure of similarity using a type of multi-modal clustering such as K-means or wavefront clustering. The identified clusters may be further analyzed based on changes to the weighting of the corresponding content, url, inlinks and outlinks multi-modal feature vectors. &lt;IMAGE&gt;</p>
申请公布号 EP1246082(A2) 申请公布日期 2002.10.02
申请号 EP20020007425 申请日期 2002.03.28
申请人 XEROX CORPORATION 发明人 CHI, ED H.;HEER, JEFFREY;PIROLLI, PETER L.
分类号 G06F19/00;G06F17/30;(IPC1-7):G06F17/30 主分类号 G06F19/00
代理机构 代理人
主权项
地址