发明名称 PREDICTING THE BUSINESS IMPACT OF TWEET CONVERSATIONS
摘要 A system and methods are provided for identifying conversations in tweet streams. A method includes grouping tweet messages in the tweet streams into tweet groups, responsive to hashtags therefor and time intervals in which the tweet message were sent. The method further includes splitting the tweet groups into subgroups responsive to secondary hashtags and a time separation between the tweets messages. The method also includes clustering any of the subgroups into a respective same conversation responsive to word occurrences, word frequencies, and account holders. The method additionally includes merging any of the subgroups having different hashtags into the respective same conversation responsive to overlapping glossary and account lists. Each of the tweet groups and each of the subgroups correspond to a respective different one of the conversations when unable to be split, clustered, or merged.
申请公布号 US2016019659(A1) 申请公布日期 2016.01.21
申请号 US201514748507 申请日期 2015.06.24
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 DOGANATA YURDAER N.;LIN CHING-YUNG;LUNA DAVID CORBALAN;MESTRE JORDI C.;PAGES XAVIER NOGUERA;TOPKARA MERCAN;WEN ZHEN;YEH DANNY L.
分类号 G06Q50/00;G06F17/30 主分类号 G06Q50/00
代理机构 代理人
主权项 1. A method for identifying conversations in tweet streams, comprising: grouping tweet messages in the tweet streams into tweet groups, responsive to hashtags therefor and time intervals in which the tweet message were sent; splitting the tweet groups into subgroups responsive to secondary hashtags and a time separation between the tweets messages; clustering any of the subgroups into a respective same conversation responsive to word occurrences, word frequencies, and account holders; and merging any of the subgroups having different hashtags into the respective same conversation responsive to overlapping glossary and account lists, wherein each of the tweet groups and each of the subgroups correspond to a respective different one of the conversations when unable to be split, clustered, or merged.
地址 Armonk NY US