发明名称 |
METHOD AND SYSTEM FOR CHARACTERIZING WEB CONTENT |
摘要 |
An exemplary embodiment of the present invention provides a method of processing Web activity data. The method includes obtaining a database of clickstream data comprising a user identifier corresponding with a user ID and a uniform resource locator (URL) corresponding with a Web page visited from the user ID. The method also includes generating a plurality of features based on the URL. Further, the method includes generating a data structure comprising the user ID and the feature. The method also includes generating segment information from the data structure based on the similarity of a URL visitation pattern across different user IDs, wherein each segment in the segment information comprises one or more user IDs and one or more features.
|
申请公布号 |
US2011029505(A1) |
申请公布日期 |
2011.02.03 |
申请号 |
US20090533717 |
申请日期 |
2009.07.31 |
申请人 |
SCHOLZ MARTIN B;RAJARAM SHYAM SUNDAR;LUKOSE RAJAN |
发明人 |
SCHOLZ MARTIN B.;RAJARAM SHYAM SUNDAR;LUKOSE RAJAN |
分类号 |
G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|