发明名称 System and method for hierarchical segmentation of websites by topic
摘要 An improved system and method is provided for hierarchical segmentation of websites by topic. To do so, an organization of topics may be determined within directories of a website, the hierarchical arrangement of the web pages in the website may be segmented by topic, and the segments representing regions of coherent topics in the website directory may be output. In an embodiment, a website directory may be converted into a binary tree and dynamic programming may be applied to iteratively determine whether to add a node of the tree to a segment representing a topic. A node selection cost may be evaluated to determine whether to add a node of the tree as a segment representing a topic. And a cohesiveness cost may be evaluated to determine how well a web page of the tree may be represented by its closest ancestral node that may be a segmentation point of a segment representing a topic.
申请公布号 US2008046429(A1) 申请公布日期 2008.02.21
申请号 US20060505010 申请日期 2006.08.16
申请人 YAHOO! INC. 发明人 PUNERA KUNAL;RAVIKUMAR SHANMUGASUNDARAM;TOMKINS ANDREW
分类号 G06F7/00 主分类号 G06F7/00
代理机构 代理人
主权项
地址