发明名称 GRAPHIC REPRESENTATIONS OF DATA RELATIONSHIPS
摘要 Presenting a diagram indicating relationships among data items stored in a data management system includes: receiving a request that identifies a first data item stored in the data management system from a user interface; retrieving stored configuration information that includes a plurality of selection specifications for selecting data items in the data management system that are related to a given data item of a predetermined type, where each selection specification is associated with a different respective predetermined type; querying the data management system to identify a set of one or more data items according to a selection specification from the configuration information that is associated with a type of the first data item; for each of multiple returned data items in the identified set, querying the data management system to determine whether additional data items are identified according to a selection specification from the configuration information that is associated with a type of the returned data item; generating a diagram indicating relationships among data items identified using the configuration information; and presenting the generated diagram over the user interface.
申请公布号 US2017091345(A1) 申请公布日期 2017.03.30
申请号 US201615378322 申请日期 2016.12.14
申请人 Ab Initio Technology LLC 发明人 Brainerd Jeffrey;Morse Alan
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A method for presenting a data lineage diagram indicating relationships among metadata items stored in a metadata management system accessible to a computing system, the method including: receiving a request that identifies a first metadata item, stored in the metadata management system, from a user interface, the first metadata item including metadata describing first stored data that corresponds to at least a portion of a dataset or an executable program and is stored in a first data source different from the metadata management system; providing a plurality of selection specifications for executing queries from the computing system to select metadata items in the metadata management system that are related to a given metadata item; querying the metadata management system using one or more of the plurality of selection specifications to identify a set of one or more metadata items that are related to the first metadata item, where a first selection specification from the plurality of selection specifications is associated with the first metadata item; for each of multiple returned metadata items in the identified set, querying the metadata management system to identify additional metadata items according to a selection specification from the plurality of selection specifications that is associated with the returned metadata item, including performing multiple iterations of queries, where each iteration includes identifying another set of one or more metadata items each related to at least one metadata item of a previous set of one or more metadata items from a previous iteration, and where iterations of queries are performed until no more metadata items are found that are related to any of the previous sets of one or more metadata items; and generating a data lineage diagram indicating data lineage relationships among programs and data represented by the metadata items identified using the querying, where the data lineage diagram includes nodes that correspond to the programs and data represented by the metadata items and edges that represent the data lineage relationships among the programs and data represented by the metadata items, and the edges include two or more of: an edge representing a data lineage relationship between a node representing the first stored data and a node representing stored data that corresponds to at least a portion of a dataset or an executable program stored in a second data source different from first data source and the metadata management system;an edge representing a downstream data lineage relationship between the node representing the first stored data and a node representing stored data that corresponds to at least a portion of a dataset or an executable program affected by the first stored data; andan edge representing an upstream data lineage relationship between the node representing the first stored data and a node representing stored data that corresponds to at least a portion of a dataset or an executable program from which the first stored data is derived.
地址 Lexington MA US