发明名称 MAILBOX SEARCH ENGINE USING QUERY MULTI-MODAL EXPANSION AND COMMUNITY-BASED SMOOTHING
摘要 A retrieval method on a database of documents including text and names of participants associated with the documents includes: receiving a text query facet of keywords and a persons query facet of participant names; computing an enriched text query as an aggregation of the text query facet, a monomodal expansion of the text query facet based on the keywords, a cross-modal expansion of the text query facet based on the participant names, and a topic expansion of the text query facet based on a topic model associating words and topics; computing an enriched persons query as an aggregation of the persons query facet, a monomodal expansion of the persons query facet based on the participant names, a cross-modal expansion of the persons query facet based on the keywords, and a community expansion of the persons query facet based on a community model associating persons and communities.
申请公布号 US2014280207(A1) 申请公布日期 2014.09.18
申请号 US201313832463 申请日期 2013.03.15
申请人 XEROX CORPORATION 发明人 Renders Jean-Michel;Mantrach Amin
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A non-transitory computer-readable storage medium storing instructions executable by a computer to perform a retrieval method on a database of documents including text and names of participants associated with the documents by operations including: receiving a multi-faceted retrieval query having a text query facet comprising one or more keywords and a persons query facet comprising one or more participant names; computing an enriched text query as an aggregation of the text query facet, a monomodal expansion of the text query facet based on the one or more keywords, a cross-modal expansion of the text query facet based on the one or more participant names, and a topic expansion of the text query facet based on a topic model associating words and topics; computing an enriched persons query as an aggregation of the persons query facet, a mono-modal expansion of the persons query facet based on the one or more participant names, a cross-modal expansion of the persons query facet based on the one or more keywords, and a community expansion of the persons query facet based on a community model associating persons and communities; and performing ranking including at least one of: (1) generating a ranking of documents by sorting similarities between the enriched text query and documents of the database, and(2) generating a ranking of persons by sorting the enriched persons query.
地址 Norwalk CT US