发明名称 |
MAILBOX SEARCH ENGINE USING QUERY MULTI-MODAL EXPANSION AND COMMUNITY-BASED SMOOTHING |
摘要 |
A retrieval method on a database of documents including text and names of participants associated with the documents includes: receiving a text query facet of keywords and a persons query facet of participant names; computing an enriched text query as an aggregation of the text query facet, a monomodal expansion of the text query facet based on the keywords, a cross-modal expansion of the text query facet based on the participant names, and a topic expansion of the text query facet based on a topic model associating words and topics; computing an enriched persons query as an aggregation of the persons query facet, a monomodal expansion of the persons query facet based on the participant names, a cross-modal expansion of the persons query facet based on the keywords, and a community expansion of the persons query facet based on a community model associating persons and communities. |
申请公布号 |
US2014280207(A1) |
申请公布日期 |
2014.09.18 |
申请号 |
US201313832463 |
申请日期 |
2013.03.15 |
申请人 |
XEROX CORPORATION |
发明人 |
Renders Jean-Michel;Mantrach Amin |
分类号 |
G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
1. A non-transitory computer-readable storage medium storing instructions executable by a computer to perform a retrieval method on a database of documents including text and names of participants associated with the documents by operations including:
receiving a multi-faceted retrieval query having a text query facet comprising one or more keywords and a persons query facet comprising one or more participant names; computing an enriched text query as an aggregation of the text query facet, a monomodal expansion of the text query facet based on the one or more keywords, a cross-modal expansion of the text query facet based on the one or more participant names, and a topic expansion of the text query facet based on a topic model associating words and topics; computing an enriched persons query as an aggregation of the persons query facet, a mono-modal expansion of the persons query facet based on the one or more participant names, a cross-modal expansion of the persons query facet based on the one or more keywords, and a community expansion of the persons query facet based on a community model associating persons and communities; and performing ranking including at least one of:
(1) generating a ranking of documents by sorting similarities between the enriched text query and documents of the database, and(2) generating a ranking of persons by sorting the enriched persons query. |
地址 |
Norwalk CT US |