摘要 |
<p>The present disclosure provides a technique of text categorization to simplify and optimize the classification. In one aspect, a method parses a given text into one or more words; determines a word vector in a spherical space model for one of the one or more words, a number of dimensions of the spherical space being equal to a number of categories, each category corresponding to a spherical space category vector; for each category, determines a distance between a sum of word vectors of the one or more words and the respective category vector; and classifies the text into one or more categories with the shortest distance. The present disclosure also provides an apparatus used to implement the method.</p> |