发明名称 Ranking expertise
摘要 Methods, systems and apparatus, including computer program products, for ranking expertise. In some implementations a method is provided that includes identifying a plurality of identities, and identifying a plurality of topics using one or more documents in a repository. For a document in a corpus of documents identifying one or more occurrences of any identity in the plurality of identities and one or more occurrences of any topic in the plurality of topics, determining an association between the identities occurring in the document and the document including deriving an identity score for each unique identity occurring in the document, determining an association between the topics occurring in the document and the document including deriving a topic score for each unique topic occurring in the document, and using the determined associations to derive a score of the document with respect to identities and topics occurring in the document.
申请公布号 US8892549(B1) 申请公布日期 2014.11.18
申请号 US200711772022 申请日期 2007.06.29
申请人 Google Inc. 发明人 Thakur Shashidhar A.
分类号 G06F17/30;G06F7/00 主分类号 G06F17/30
代理机构 Fish & Richardson P.C. 代理人 Fish & Richardson P.C.
主权项 1. A computer-implemented method comprising: identifying a plurality of identities stored in a repository of identities, each identity corresponding to an expert of one or more topics; identifying a plurality of topics stored in a repository of information, each topic i) describing information included by a document of a corpus of documents and ii) distinguishing the document from the remaining documents of the corpus of documents, the plurality of topics including the one or more topics; and processing each document in the corpus of documents, the processing including: identifying one or more identities of the plurality of identities that occur within the document,identify one or more topics of the plurality of topics that occur within the document,for each identity that occurs within the document, determining, using one or more processors, an identity score for the identity with respect to the document, the identity score indicating a degree of relevance between the associated identity and the document,for each topic that occurs within the document, determining a topic score for the topic with respect to the document, the topic score indicating a degree of relevance between the associated topic and the document,identifying one or more combinations of i) the one or more identities that occur within the document, and ii) the one or more topics that occur within the document,for each identified combination, determining an aggregate score for the document based on the identity score associated with the combination for the document and the topic score associated with the combination for the document; and aggregating, for each identified combination, the aggregate score of each document of the corpus of documents for the identified combination to define a composite score of the identified combination across the corpus of documents.
地址 Mountain View CA US