发明名称 System and method for storage, querying, and analysis service for time series data
摘要 A service for storing time series data provides a data pipe for receiving time series data, a query pipe for making requests to the service, and a result pipe for receiving output from the service. Data sent to the query pipe is processed by an ingester that prepares metadata indices associated with blocks of incoming time series data and stores the blocks of data in a time series database and the indices in a separate index database. A query layer receives queries from the query pipe and uses the index database to determine which data blocks are needed to process the query, and then requests only those data blocks from the time series database. Processing of the query is performed within the time series database only on those data nodes that contain relevant data, and partial results are passed to an output layer for formation into a final query result which is sent out by the results pipe.
申请公布号 US9087098(B2) 申请公布日期 2015.07.21
申请号 US201213716567 申请日期 2012.12.17
申请人 General Electric Company 发明人 Lin Jerry;Aggour Kareem Sherif;Courtney Brian Scott;Interrante John Alan;LaComb Christina Ann;Mathur Sunil;McConnell Christopher Thomas;Snell Quinn
分类号 G06F17/30 主分类号 G06F17/30
代理机构 Schwegman Lundberg & Woessner, P.A. 代理人 Schwegman Lundberg & Woessner, P.A.
主权项 1. A method of offering a service for storage and querying of real-time asset operational data, comprising: providing a data pipe, comprising a connection through which data flows from a data generator to an ingester, the ingester configured to perform operations comprising: receiving time stamped data from the data generator;reading the received time stamped data;creating a data block and an index associated with the time stamped data;storing the index in an index database; andstoring the data block in a time series database in which the time series database stores the data blocks across a plurality of computing devices; providing a query pipe, comprising a connection through which a query flows from a requestor to a query layer, the query layer configured to perform operations comprising: receiving a query that specifies criteria defining a set of data retrieved from the service;requesting from the index database the indices associated with the data blocks stored in the time series database that are needed to evaluate the query;preparing a sub-query that produces appropriate data matching the criteria, the sub-query comprising the criteria and a logical operation performed on data matching the criteria; andsending the sub-query to an evaluator running on each of the computing devices that corresponds to the data blocks identified in the requesting operation, above, the evaluator configured to perform operations comprising: receiving the sub-query from the query layer;evaluating the criteria specified in the sub-query with respect to the data blocks stored on the same computing devices as the evaluator in order to select a subset of data;performing the logical operation specified in the sub-query on the subset of data to produce a sub-result; andreturning the sub-result to an output handler;receive the sub-result from the evaluators specified in the sending operation, above; providing a result pipe, comprising a connection through which a result flows from the output handler to the requestor, the output handler configured to perform operations comprising: receiving the sub-results produced in response to each sub-query;combining the sub-results into a query result; andreturning the result to the requestor.
地址 Schenectady NY US