摘要 |
Some embodiments of the present invention provide a system that selects links while updating a probabilistic generative model for textual documents. During operation, the system receives a current model, which contains terminal nodes representing words and cluster nodes representing clusters of conceptually related words, wherein nodes in the current model are coupled together by weighted links, wherein if a node fires, a link from the node to another node is activated and causes the other node to fire with a probability proportionate to the weight of the link. Next, the system applies a set of training documents containing words to the current model to produce a new model. While doing so, the system: determines expected counts for activations of links and prospective links; determines link-ratings for the links and the prospective links based on the expected counts, and selects links to be included in the new model based on the determined link-ratings. Finally, the system makes the new model the current model. |