发明名称 NORMALIZING VALUES IN DATA TABLES
摘要 A computer-implemented method for normalizing data tables, where a lexical values and the structure of data within a data table are identified and interpreted. The data table is transformed into a tree form, representing a hierarchical relationship among the lexical values. Information to be normalized is identified. A normalization dictionary with aggregated statistical information of lexical values and corresponding word-senses is generated. And information to be normalized is normalized based on the normalization dictionary.
申请公布号 US2017052985(A1) 申请公布日期 2017.02.23
申请号 US201615195233 申请日期 2016.06.28
申请人 International Business Machines Corporation 发明人 Guggilla Chinnappa;Mustafi Joy
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A computer-implemented method for normalizing data tables, comprising: identifying and interpreting a plurality of lexical values within a data table; identifying and interpreting a structure of the data table; transforming the data table into a tree form, wherein the tree form simulates a hierarchical tree structure representing a relationship among the plurality of lexical values as a set of linked nodes; identifying information to be normalized, wherein the information to be normalized is not defined within the data table; generating a normalization dictionary comprising one or more arrays of aggregated statistical information of lexical values and corresponding word-senses; and normalizing the information to be normalized using the normalization dictionary.
地址 Armonk NY US