发明名称 |
UNSTRUCTURED DATA GUIDED QUERY MODIFICATION |
摘要 |
A method, system, and computer program product for unstructured data guided query modification are provided in the illustrative embodiments. A set of parameters is identified in a structured database query. Using a Natural language processing (NLP) engine, a set of tokens is identified in an unstructured data. Using the NLP engine, corresponding to a subset of the set of parameters, sets of variations are obtained. A fit is found between a first token from the set of tokens and a first variant of a first parameter, the first variant of the first parameter being a member of a first set of variations corresponding to the first parameter. The first parameter in the structured database query is substituted with the first variant to produce a substituted query, wherein the substituted query produces a result set that is related to the unstructured data. |
申请公布号 |
US2016063095(A1) |
申请公布日期 |
2016.03.03 |
申请号 |
US201414469705 |
申请日期 |
2014.08.27 |
申请人 |
International Business Machines Corporation |
发明人 |
NASSAR AHMED M.A.;Omar Eman;Rosengarten Evelyn M.;Trim Craig M. |
分类号 |
G06F17/30;G06F17/27 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
1. A method for unstructured data guided query modification, the method comprising:
identifying, using a processor and a memory, a set of parameters in a structured database query; identifying, using a Natural language processing (NLP) engine, a set of tokens in an unstructured data; obtaining, using the NLP engine, corresponding to a subset of the set of parameters, sets of variations, wherein a particular set of variations corresponds to a particular parameter in the subset of parameters; finding a fit between a first token from the set of tokens and a first variant of a first parameter, the first variant of the first parameter being a member of a first set of variations corresponding to the first parameter; and substituting the first parameter in the structured database query with the first variant to produce a substituted query, wherein the substituted query produces a result set that is related to the unstructured data. |
地址 |
Armonk NY US |