摘要 |
A method for categorising input data objects e.g. web pages using a model comprising a plurality of patterns e.g. words associated with at least one weighting for a category, includes the steps of: identifying patterns within the input data object that correspond to at least some of the patterns within the model; for each identified pattern, determining a weighting for at least one category from the model; calculating a score for the input data object for at least one category based at least in part on the weightings of the identified patterns and the frequency of the identified patterns within the input data object; and categorising the input data object based at least in part on the calculated score. Also disclosed is a method for generating a categorisation model. |