发明名称 |
Method for identifying related pages in a hyperlinked database |
摘要 |
A method is described for identifying related pages among a plurality of pages in a linked database such as the World Wide Web. An initial page is selected from the plurality of pages. Pages linked to the initial page are represented as a graph in a memory. The pages represented in the graph are scored on content, and a set of pages is selected, the selected set of pages having scores greater than a first predetermined threshold. The selected set of pages is scored on connectivity, and a subset of the set of pages that have scores greater than a second predetermined threshold are selected as related pages.
|
申请公布号 |
US2004193636(A1) |
申请公布日期 |
2004.09.30 |
申请号 |
US20030702116 |
申请日期 |
2003.11.03 |
申请人 |
BLACK JEFFREY DEAN;HENZINGER MONIKA R.;BRODER ANDREI Z. |
发明人 |
BLACK JEFFREY DEAN;HENZINGER MONIKA R.;BRODER ANDREI Z. |
分类号 |
G06F15/00;G06F17/00;G06F17/30;(IPC1-7):G06F17/00 |
主分类号 |
G06F15/00 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|