发明名称 CORRELATION OF DATA SETS USING DETERMINED DATA TYPES
摘要 A computer receives a data set and determines the data type of the column data within. The computer identifies a second data set with columns of the same data type. The computer compares the contents of the columns and the formatting of the contents to determine a score representative of the relevancy of the data sets to one another. Responsive to the score exceeding a threshold, the computer suggests the second data set to a user.
申请公布号 US2015032609(A1) 申请公布日期 2015.01.29
申请号 US201313952714 申请日期 2013.07.29
申请人 International Business Machines Corporation 发明人 Abuelsaad Tamer E.;Boss Gregory J.;Trim Craig M.
分类号 G06F17/30;G06Q20/10 主分类号 G06F17/30
代理机构 代理人
主权项 1. A computer-implemented method for correlating data sets, the method comprising: determining, by a computer system, for a first data set comprising one or more columns, each column comprising column data, a data type of the column data of a first column of the first data set; identifying, by the computer system, a second column of a second data set associated with the data type; comparing, by the computer system, the column data of the first and second columns and, in response, determining, by the computer system, a score representing a degree of relevance between the first and second data columns; and determining, by the computer system, whether the score exceeds a threshold and, if so, suggesting, by the computer system, the second data set to a user.
地址 Armonk NY US