发明名称 Optimization model for processing hierarchical data in stream systems
摘要 Provided are techniques for optimizing the processing of hierarchical data. A linear processing graph is received, wherein the linear processing graph includes a plurality of operators, wherein each operator in the plurality is connected to at least one other operator by an arc, wherein hierarchical data flows on arcs, wherein the operators read and replace identified subregions within the hierarchical data flowing into the operators on the arcs, and wherein the operators do not modify the hierarchical data outside of these identified subregions. For each operator in the linear processing graph, a minimal set of dependent upstream operators on which that operator depends is found by examining how the identified subregions are created in the linear processing graph through obtaining a set of operators on which that operator depends, by analyzing dependencies carried by a set of vector nodes of the hierarchical data in an input schema of the operator, and, for each of the vector nodes, by analyzing an associated set of scalar nodes, wherein finding the minimum set of operators includes taking into consideration data preservation characteristics of the plurality of operators and taking into consideration structural-order preservation characteristics of the plurality of operators. The linear processing graph is rewritten to create a new graph that expresses dependencies based on the minimal set of dependent upstream operators for each operator.
申请公布号 US7860863(B2) 申请公布日期 2010.12.28
申请号 US20070850588 申请日期 2007.09.05
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 BAR-OR AMIR;BECKERLE MICHAEL JAMES
分类号 G06F7/00 主分类号 G06F7/00
代理机构 代理人
主权项
地址