发明名称 Overlapping community detection in weighted graphs
摘要 The disclosure includes a system and method for detecting communities in a weighted graph. The community detection module includes a tagset data aggregator, a counts statistics engine, a weighted graph generator, a coherence engine, a community detector and a tag recommendation engine. The tagset data aggregator receives tagset data. The counts statistics engine determines counts statistics for the tagset data. The weighted graph generator generates and denoises weighted tag occurrence graph based on the counts statistics. The coherence engine determines importance score for all tags and coherence score for all tagsets in the tagset data. The community detector determines maximally coherent communities in the weighted tag co-occurrence graph. The tag recommendation engine recommends tags in real time using the maximally coherent communities.
申请公布号 US9418142(B2) 申请公布日期 2016.08.16
申请号 US201414286297 申请日期 2014.05.23
申请人 Google Inc. 发明人 Kumar Shailesh
分类号 G06F17/00;G06F17/30;G06K9/62 主分类号 G06F17/00
代理机构 Patent Law Works LLP 代理人 Patent Law Works LLP
主权项 1. A computer-implemented method comprising: identifying, using one or more computing devices, a context; determining, using the one or more computing devices, a plurality of tagsets each including one or more tags describing an entity and a vocabulary of unique tags defined by the identified context; generating, using the one or more computing devices, counts statistics using the plurality of tagsets and the vocabulary of unique tags; determining a measure of co-occurrence consistent for a pair of tags in the vocabulary of unique tags based on the counts statistics, the measure of co-occurrence consistent indicating a likelihood of the pair of tags co-occurring in a tagset from the plurality of tagsets relative to random; generating, using the one or more computing devices, a weighted tag co-occurrence graph including the pair of tags in the vocabulary of unique tags based on the measure of co-occurrence consistent; denoising, using the one or more computing devices, the weighted tag co-occurrence graph; and responsive to removing the noise, identifying, using the one or more computing devices, at least one community in the weighted tag co-occurrence graph.
地址 Mountain View CA US
您可能感兴趣的专利