发明名称 Method and Apparatus for Determining the Schema of a Graph Dataset
摘要 A schema for a dataset is identified by identifying a dataset comprising data and relationships between data pairs. An original schema is identified for the dataset. This original schema comprises an organizational structure. An initial fit between the dataset and the original schema is determined. The initial fit quantifying a conformity of the data in the dataset to the organizational structure of the original schema. A plurality of additional schemas are identified. Each additional schema is a distinct organizational schema. The dataset is partitioned into a plurality of subsets. Each subset comprises a modified fit quantifying a modified conformity of subset data in each subset to one of the original schema and the additional schemas. The modified fit is greater than the original fit.
申请公布号 US2015193478(A1) 申请公布日期 2015.07.09
申请号 US201414151768 申请日期 2014.01.09
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 Arenas Marcelo;Diaz Gonzalo;Fokoue-Nkoutche Achille Belly;Kementsietsidis Anastasios;Srinivas Kavitha
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A method for determining a schema for a dataset, the method comprising: identifying a dataset comprising data and relationships between data pairs; identifying an original schema for the dataset, the original schema comprising an organizational structure; determining an initial fit between the dataset and the original schema, the initial fit quantifying a conformity of the data in the dataset to the organizational structure of the original schema; identifying a plurality of additional schemas, each additional schema comprising a distinct organizational schema; and partitioning the dataset into a plurality of subsets, each subset comprising a modified fit quantifying a modified conformity of subset data in each subset to one of the original schema and the additional schemas, the modified fit greater than the original fit.
地址 Armonk NY US