发明名称 METHOD AND SYSTEM FOR SUBJECT RELEVANT WEB PAGE FILTERING BASED ON NAVIGATION PATHS INFORMATION
摘要 Method and system to utilize the set of navigation paths of web pages as the contextual information for subject relevant web page filtering with high accuracy are provided. The method comprises the steps of: obtaining all web pages in one or more web pages collections; collecting information of the links among the obtained web pages; extracting, based on the collected links, a set of navigation paths of each of the obtained web pages; and filtering the obtained web pages based on the extracted set of navigation paths to obtain desired web pages. In some embodiments, the extraction of the navigation paths is preferably performed on the navigation links of the web pages. Therefore, the method also comprises the process for deleting non-navigation links from all the links of the web pages. Compared with the prior art, the present invention can utilize the contextual information of the web pages for web page filtering in a more sufficient way, thereby improving the accuracy and objectivity of the web page filtering.
申请公布号 US2009083244(A1) 申请公布日期 2009.03.26
申请号 US20080236166 申请日期 2008.09.23
申请人 NEC (CHINA) CO., LTD. 发明人 LI JIANQIANG;ZHAO YU
分类号 G06F7/06;G06F17/00;G06F17/30 主分类号 G06F7/06
代理机构 代理人
主权项
地址