发明名称 Computer-implemented systems and methods for efficient structuring of time series data
摘要 Systems and methods are provided for analyzing through one-pass of unstructured time stamped data of a physical process. A distribution of time-stamped unstructured data is analyzed to identify a plurality of potential hierarchical structures for the unstructured data. A hierarchical analysis of the potential hierarchical structures is performed to determine an optimal frequency and a data sufficiency metric for the potential hierarchical structures. One of the potential hierarchical structures is selected as a selected hierarchical structure based on the data sufficiency metrics. The unstructured data is structured according to the selected hierarchical structure and the optimal frequency associated with the selected hierarchical structure, where said structuring of the unstructured data is performed via a single pass though the unstructured data. The identified statistical analysis of the physical process is performed using the structured data.
申请公布号 US9244887(B2) 申请公布日期 2016.01.26
申请号 US201213548282 申请日期 2012.07.13
申请人 SAS Institute Inc. 发明人 Leonard Michael James;Crowe Keith Eugene;Christian Stacey M.;Beeman Jennifer Leigh Sloan;Elsheimer David Bruce;Blair Edward Tilden
分类号 G06F17/18;G06F7/00;G06F17/30;G06Q10/04;G06F11/34 主分类号 G06F17/18
代理机构 Kilpatrick Townsend & Stockton LLP 代理人 Kilpatrick Townsend & Stockton LLP
主权项 1. A computer-implemented method comprising: analyzing, using a time series engine, a distribution of unstructured time-stamped data to identify a plurality of potential time series data hierarchies for structuring the unstructured time-stamped data, wherein a potential time series data hierarchy is a framework for structuring the data using of multiple time series, andwherein the time series engine is at a server layer of a time series computing system; performing, using the time series engine, an analysis of the plurality of potential time series data hierarchies, wherein performing the analysis of the plurality of potential time series data hierarchies includes determining an optimal time series frequency and a data sufficiency metric for each of the plurality of potential time series data hierarchies; comparing data sufficiency metrics for the plurality of potential time series data hierarchies; selecting a hierarchy of the plurality of potential time series data hierarchies based on the comparison of the data sufficiency metrics; structuring the unstructured time-stamped data into structured time-stamped data according to the hierarchy and the optimal time series frequency, wherein structuring the transformed time-stamped data into the structured time-stamped data is performed using a single pass of the unstructured time-stamped data through the time series engine; computing a plurality of transformations of the structured time-stamped data using the single pass of the structured time-stamped data through the time series engine; transforming the structured time-stamped data into transformed time-stamped data according to the plurality of transformations; and providing, using an application programming interface, the transformed time-stamped data for visual presentation.
地址 Cary NC US