发明名称 Data profiling
摘要 There is provided a method and system for processing data. The method includes: identifying (1706) a plurality of subsets of fields of data records of a data source, at least one of the subsets of fields being a subset of a first field and a second field; determining co-occurrence statistics for each of the plurality of subsets, including: partitioning (1714) the data records into parts; for each part, forming (1718) data elements, each data element identifying the first field and the second field and identifying a pair of values occurring in the first and second fields in one of the data records, and; identifying (1728) one or more of the plurality of subsets as having a functional relationship among the fields of the identified subset.
申请公布号 EP2261820(A3) 申请公布日期 2010.12.29
申请号 EP20100009155 申请日期 2004.09.15
申请人 AB INITIO TECHNOLOGY LLC 发明人 GOULD JOEL;FEYNMAN CARL;BAY PAUL
分类号 G06F17/30;G06F17/00 主分类号 G06F17/30
代理机构 代理人
主权项
地址