摘要 |
This disclosure describes systems and methods for categorizing web pages. Web pages and terms selected from those web pages are organized in a matrix. The number of terms in the matrix are filtered using a Laplacian score algorithm. A linear regression algorithm or some other algorithm may use the filtered set of terms to fit the web pages into pre-defined categories.
|