摘要 |
Disclosed is an Internet information searching, aggregating and presentation method. The method comprises: 1) crawling pages on the Internet, and establishing indexes corresponding to the webpages according to text content of the pages; 2) retrieving an aggregate content library according to an input query term, and if answer content corresponding to the query term exists, returning the answer content as a search result; and if the answer content does not exist, carrying out step 3); 3) utilizing the established indexes to carry out webpage retrieval according to the query term, and obtaining a candidate result set; 4) comparing the similarity of the text content of the webpages in the candidate result set to obtain a series of similar page groups {S1, S2,..., Sk}; 5) extracting similar content and different content of all the webpages in each similar page group Si, and combining the content to form a new page Pi; and 6) returning the Si and Pi of each group as answer content, and saving the answer content in the aggregate content library. The present invention can directly provide a valuable information service for a user. |
申请人 |
COMPUTER NETWORK INFORMATION CENTER, CHINESE ACADEMY OF SCIENCES |
发明人 |
LI, XIAODONG;YANG, LIUQING;HONG, BO;CHEN, YONG;GENG, GUANGGANG |