发明名称 Method and apparatus for selecting links to include in a probabilistic generative model for text
摘要 Some embodiments of the present invention provide a system that selects links while updating a probabilistic generative model for textual documents. During operation, the system receives a current model, which contains terminal nodes representing words and cluster nodes representing clusters of conceptually related words, wherein nodes in the current model are coupled together by weighted links, wherein if a node fires, a link from the node to another node is activated and causes the other node to fire with a probability proportionate to the weight of the link. Next, the system applies a set of training documents containing words to the current model to produce a new model. While doing so, the system: determines expected counts for activations of links and prospective links; determines link-ratings for the links and the prospective links based on the expected counts, and selects links to be included in the new model based on the determined link-ratings. Finally, the system makes the new model the current model.
申请公布号 US8180725(B1) 申请公布日期 2012.05.15
申请号 US20080176621 申请日期 2008.07.21
申请人 LERNER URI N.;JAHR MICHAEL;GOOGLE INC. 发明人 LERNER URI N.;JAHR MICHAEL
分类号 G06F17/00 主分类号 G06F17/00
代理机构 代理人
主权项
地址