发明名称 METHOD AND SYSTEM FOR CHARACTERIZING WEB CONTENT
摘要 An exemplary embodiment of the present invention provides a method of processing Web activity data. The method includes obtaining a database of clickstream data comprising a user identifier corresponding with a user ID and a uniform resource locator (URL) corresponding with a Web page visited from the user ID. The method also includes generating a plurality of features based on the URL. Further, the method includes generating a data structure comprising the user ID and the feature. The method also includes generating segment information from the data structure based on the similarity of a URL visitation pattern across different user IDs, wherein each segment in the segment information comprises one or more user IDs and one or more features.
申请公布号 US2011029505(A1) 申请公布日期 2011.02.03
申请号 US20090533717 申请日期 2009.07.31
申请人 SCHOLZ MARTIN B;RAJARAM SHYAM SUNDAR;LUKOSE RAJAN 发明人 SCHOLZ MARTIN B.;RAJARAM SHYAM SUNDAR;LUKOSE RAJAN
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址