发明名称 Weighting Search Criteria Based on Similarities to an Ingested Corpus in a Question and Answer (QA) System
摘要 A mechanism is provided, in a data processing system comprising a processor and a memory configured to implement a question and answer (QA) system, for weighting search criteria based on similarities to an ingested corpus in the QA system. A set of question characteristics found in a received input question are compared to a set of data characteristics respectively describing data in each corpus of a corpora. For each question characteristic in the set of found question characteristics, a first weight is assigned to the corpus within which data associated with the data characteristic resides in response to the question characteristic being more related to a data characteristic; otherwise a second weight is assigned, where the first weight is greater than the second weight. A selective search is then performed for an answer to the received input question in one or more corpora with a higher weighting.
申请公布号 US2015356089(A1) 申请公布日期 2015.12.10
申请号 US201414300456 申请日期 2014.06.10
申请人 International Business Machines Corporation 发明人 Jamrog Daniel M.;LaVoie Jason D.;Orrick Nicholas W.;Witherspoon Kristin A.
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A method, in a data processing system comprising a processor and a memory configured to implement a question and answer (QA) system, for weighting search criteria based on similarities to an ingested corpus in the QA system, the method comprising: parsing a received input question having a set of question characteristics; comparing the set of question characteristics found in the received input question to a set of data characteristics respectively describing data in each corpus of a corpora; for each question characteristic in the set of found question characteristics: responsive to the question characteristic being more related to a data characteristic in the set of data characteristics, assigning a first weight to the corpus within which data associated with the data characteristic resides; andresponsive to the question characteristic being less related to the data characteristic in the set of data characteristics, assigning a second weight to the corpus within which the data associated with the data characteristic resides, wherein the first weight is greater than the second weight; and selectively searching for an answer to the received input question in one or more corpora with a higher weighting preferentially to one or more corpora with a lower weighting.
地址 Armonk NY US