发明名称 |
AUTOMATIC TOPIC DISCOVERY IN STREAMS OF UNSTRUCTURED DATA |
摘要 |
A method is provided for automatically discovering topics in electronic posts, such as social media posts. The method includes receiving a corpus that includes a plurality of electronic posts. The method further includes identifying a plurality of candidate terms within the corpus and selecting, as a trimmed lexicon, a subset of the plurality of candidate terms using predefined criteria. The method further includes clustering at least a subset of the plurality of electronic posts according to a plurality of clusters using the lexicon to produce a plurality of statistical topic models. The method further includes storing information corresponding to the statistical topic models. |
申请公布号 |
WO2015161129(A8) |
申请公布日期 |
2016.04.28 |
申请号 |
WO2015US26252 |
申请日期 |
2015.04.16 |
申请人 |
UDA, LLC |
发明人 |
WEISSINGER, STEVE;STEVENS, LUIS;SCHIAVONE, VINCENT |
分类号 |
G06F17/28;G06F17/27 |
主分类号 |
G06F17/28 |
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|