发明名称 Parallel processing of semantically grouped data in data warehouse environments
摘要 A system and method for parallel processing of semantically grouped data in data warehouse environments is disclosed. A datastore object having a number of records is generated in a data warehouse application. A hash value is added to each record. The hash value has an integer domain, and is uniformly distributed over the integer domain across the datastore object. A selection table is generated to create a number of tasks based on discrete ranges of the hash value. Then, a transformation routine is executed on each of the number of tasks in parallel to generate an infocube of data that corresponds to each range of the discrete ranges of the hash value.
申请公布号 US8892502(B2) 申请公布日期 2014.11.18
申请号 US201113314076 申请日期 2011.12.07
申请人 SAP SE 发明人 Hermann Alexander;Jakschitsch Hannes
分类号 G06F17/30 主分类号 G06F17/30
代理机构 Mintz Levin Cohn Ferris Glovsky and Popeo, P.C. 代理人 Mintz Levin Cohn Ferris Glovsky and Popeo, P.C.
主权项 1. A computer-implemented method comprising: generating a datastore object by one or more processors of a data warehouse application, the datastore object having a number of records; adding, by the one or more processors, a hash value to each record, the hash value having an integer domain, the one or more processors uniformly distributing the hash value over the integer domain across the datastore object; generating, by the one or more processors, a selection table to create a number of tasks based on discrete ranges of the hash value; and executing, by the one or more processors, a transformation routine on each of the number of tasks in parallel to generate an infocube of data of a plurality of infocubes of data, each infocube corresponding to a respective range of the discrete ranges of the hash value.
地址 Walldorf DE