摘要 |
<p>The present invention is a system and method for building an intelligent index of Internet web pages. A populator retrieves a web page, divides words within the web page into categories, and determines a relevancy rating for the words in each category, the relevancy rating based on the number of appearances of the word in the corresponding category. The populator then weights each relevancy rating by a weighting factor corresponding to the category, and sums the weighted relevancy ratings to determine a web page relevancy rating for each unique word. The categories include a header, hidden words, non-sentences, repetitive words, non-nouns, and nouns. Each category is further subdivided into subcategories of commonly used words and uncommonly used words. A relevancy rating is determined for each subcategory.</p> |