发明名称 System for storage, querying, and analysis of time series data
摘要 A system for storing time series data includes an ingester that prepares metadata indices associated with blocks of incoming time series data and stores the blocks of data in a time series database and the indices in a separate index database. The time series database distributes storage of the data blocks among multiple data nodes. A query layer receives queries and uses the index database to determine which data blocks are needed to process the query, and then requests only those data blocks from the time series database. Processing of the query is performed within the time series database only on those data nodes that contain relevant data, and partial results are passed to an output layer for formation into a final query result.
申请公布号 US9589031(B2) 申请公布日期 2017.03.07
申请号 US201514826807 申请日期 2015.08.14
申请人 General Electric Company 发明人 Lin Jerry;Aggour Kareem Sherif;Courtney Brian Scott;Interrante John Alan;LaComb Christina Ann;Mathur Sunil;McConnell Christopher Thomas;Snell Quinn
分类号 G06F17/30 主分类号 G06F17/30
代理机构 Schwegman Lundberg & Woessner, P.A. 代理人 Schwegman Lundberg & Woessner, P.A.
主权项 1. A method of offering a data service for storage and querying of real-time asset operational data, comprising: accessing, from a data source, time stamped data; creating a plurality of data blocks, each data block comprising a portion of the time stamped data with at least one common metadata attribute and a plurality of indices, each index associated with a corresponding data block, each index comprising metadata associated with the corresponding data block; storing the plurality of indices in an index database; storing the plurality of data blocks in a time series database comprising a plurality of computing devices, each of which stores a portion of the plurality of data blocks; receiving, at a query layer from a requestor, a query that specifies criteria defining a set of data retrieved from the service; requesting from the index database the indices associated with data blocks stored in the time series database comprising time stamped data accessed to evaluate the query; preparing a sub-query that produces appropriate data matching the criteria, the sub-query comprising the criteria and a logical operation performed on data matching the criteria; sending the sub-query to an evaluator resident on each of the computing devices that corresponds to the data blocks identified in the requesting operation; receiving at the evaluator the sub-query from the query layer; evaluating the criteria specified in the sub-query with respect to the data blocks stored on the same computing devices as the evaluator in order to select a subset of data; performing the logical operation specified in the sub-query on the subset of data to produce a sub-result; returning the sub-result to an output handler; receiving, at an output handler, the sub-results produced in response to each sub-query; combining the sub-results into a query result; and returning the query result to the requestor.
地址 Schenectady NY US