发明名称 |
System, method and apparatus for increasing speed of hierarchial latent dirichlet allocation model |
摘要 |
Embodiments of the present invention disclose a data processing method including: sending global initial statistical information to each slave node; merging received local statistical information of each slave node, to obtain new global statistical information; if Gibbs sampling performed by a slave node has ended, calculating a probability distribution between a document and topic and a probability distribution between the topic and a word according to the new global statistical information; according to the probability distributions obtained through calculation, establishing a likelihood function of a text set, and maximizing the likelihood function, to obtain a new hLDA hyper-parameter; and if iteration of solving for an hLDA hyper-parameter has converged, and according to the new hLDA hyper-parameter, calculating and outputting the probability distribution between the document and topic and the probability distribution between the topic and word.
|
申请公布号 |
US8527448(B2) |
申请公布日期 |
2013.09.03 |
申请号 |
US201213722078 |
申请日期 |
2012.12.20 |
申请人 |
HUAWEI TECHNOLOGIES CO., LTD. |
发明人 |
VLADISLAV KOPYLOV;WEN LIUFEI;SHI GUANGYU |
分类号 |
G06F9/44;G06F7/00;G06F15/18;G06F17/30;G06N7/02;G06N7/06 |
主分类号 |
G06F9/44 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|