发明名称 Keyword assignment to a web page
摘要 A method, system and apparatus for a assigning keywords to a web page using keyword data from the web page itself, web pages having links pointing to the web page, and web pages pointed to by a link in the web page, wherein the keyword data from the multiple web pages is processed to provide a relevant set of keyword data for the web page.
申请公布号 US8959091(B2) 申请公布日期 2015.02.17
申请号 US200912512702 申请日期 2009.07.30
申请人 Alcatel Lucent 发明人 Kodialam Muralidharan Sampath;Mukherjee Sarit;Wang Limin;Ihm Sunghwan
分类号 G06F17/30 主分类号 G06F17/30
代理机构 Wall & Tong, LLP 代理人 Wall & Tong, LLP
主权项 1. A method for assigning keywords to a web page to form thereby a set of web page representative keywords, comprising: identifying self keywords associated with the web page to form thereby a set of identified self keywords, the self keywords comprising keyword data from the web page; identifying in-link keywords associated with the web page to form thereby a set of identified in-link keywords, the in-link keywords comprising keyword data from other web pages including a link to the web page; identifying out-link keywords associated with the web page to form thereby a set of identified out-link keywords, the out-link keywords comprising keyword data from other web pages having a link to said other web pages from the web page; extracting, from each of the sets of identified self, in-link and out-link keywords, a plurality of potential keyword phrases, each keyword phrase comprising at least two keywords within a respective set of keywords; evaluating each of the identified keywords in each of the set and extracted keyword phrases according to a reference function to determine thereby valid keywords and keyword phrases; assigning weights to each of the valid self, in-link and out-link keywords and keyword phrases to form a set of weighted keywords and keyword phrases associated with the web page, wherein each of the valid in-link and out-link keywords and keyword phrases is assigned a weight according to a ranking of a respective source web page; generating a rank ordered of the valid keywords using one or more of count, unique count and weighted unique count heuristic functions; and combining, in the rank order, the valid self, in-link and out-link keywords and keyword phrases to form a set of web page representative keywords and keyword phrases associated with the web page separated by first delineators and stored in a memory.
地址 Boulogne-Billancourt FR