发明名称 |
NORMALIZING VALUES IN DATA TABLES |
摘要 |
A computer-implemented method for normalizing data tables, where a lexical values and the structure of data within a data table are identified and interpreted. The data table is transformed into a tree form, representing a hierarchical relationship among the lexical values. Information to be normalized is identified. A normalization dictionary with aggregated statistical information of lexical values and corresponding word-senses is generated. And information to be normalized is normalized based on the normalization dictionary. |
申请公布号 |
US2017052985(A1) |
申请公布日期 |
2017.02.23 |
申请号 |
US201615195233 |
申请日期 |
2016.06.28 |
申请人 |
International Business Machines Corporation |
发明人 |
Guggilla Chinnappa;Mustafi Joy |
分类号 |
G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
1. A computer-implemented method for normalizing data tables, comprising:
identifying and interpreting a plurality of lexical values within a data table; identifying and interpreting a structure of the data table; transforming the data table into a tree form, wherein the tree form simulates a hierarchical tree structure representing a relationship among the plurality of lexical values as a set of linked nodes; identifying information to be normalized, wherein the information to be normalized is not defined within the data table; generating a normalization dictionary comprising one or more arrays of aggregated statistical information of lexical values and corresponding word-senses; and normalizing the information to be normalized using the normalization dictionary. |
地址 |
Armonk NY US |