发明名称 UNSTRUCTURED DATA GUIDED QUERY MODIFICATION
摘要 A method, system, and computer program product for unstructured data guided query modification are provided in the illustrative embodiments. A set of parameters is identified in a structured database query. Using a Natural language processing (NLP) engine, a set of tokens is identified in an unstructured data. Using the NLP engine, corresponding to a subset of the set of parameters, sets of variations are obtained. A fit is found between a first token from the set of tokens and a first variant of a first parameter, the first variant of the first parameter being a member of a first set of variations corresponding to the first parameter. The first parameter in the structured database query is substituted with the first variant to produce a substituted query, wherein the substituted query produces a result set that is related to the unstructured data.
申请公布号 US2016063095(A1) 申请公布日期 2016.03.03
申请号 US201414469705 申请日期 2014.08.27
申请人 International Business Machines Corporation 发明人 NASSAR AHMED M.A.;Omar Eman;Rosengarten Evelyn M.;Trim Craig M.
分类号 G06F17/30;G06F17/27 主分类号 G06F17/30
代理机构 代理人
主权项 1. A method for unstructured data guided query modification, the method comprising: identifying, using a processor and a memory, a set of parameters in a structured database query; identifying, using a Natural language processing (NLP) engine, a set of tokens in an unstructured data; obtaining, using the NLP engine, corresponding to a subset of the set of parameters, sets of variations, wherein a particular set of variations corresponds to a particular parameter in the subset of parameters; finding a fit between a first token from the set of tokens and a first variant of a first parameter, the first variant of the first parameter being a member of a first set of variations corresponding to the first parameter; and substituting the first parameter in the structured database query with the first variant to produce a substituted query, wherein the substituted query produces a result set that is related to the unstructured data.
地址 Armonk NY US