发明名称 System and method for providing robust topic identification in social indexes
摘要 A computer-implemented method for providing robust topic identification in social indexes is described. Electronically-stored articles and one or more indexes are maintained. Each index includes topics that each relate to one or more of the articles. A random sampling and a selective sampling of the articles are both selected. For each topic, characteristic words included in the articles in each of the random sampling and the selective sampling are identified. Frequencies of occurrence of the characteristic words in each of the random sampling and the selective sampling are determined. A ratio of the frequencies of occurrence for the characteristic words included in the random sampling and the selective sampling is identified. Finally, for each topic, a coarse-grained topic model is built, which includes the characteristic words included in the articles relating to the topic and scores assigned to those characteristic words.
申请公布号 EP2192500(A3) 申请公布日期 2010.09.29
申请号 EP20090175873 申请日期 2009.11.13
申请人 PALO ALTO RESEARCH CENTER INCORPORATED 发明人 STEFIK, MARK J.;GOOD, LANCE E.;MITTAL, SANJAY
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址