发明名称 SYSTEMS AND/OR METHODS FOR DYNAMIC ANOMALY DETECTION IN MACHINE SENSOR DATA
摘要 Certain example embodiments relate to techniques for detecting anomalies in streaming data. More particularly, certain example embodiments use an approach that combines both unsupervised and supervised machine learning techniques to create a shared anomaly detection model in connection with a modified k-means clustering algorithm and advantageously also enables concept drift to be taken into account. The number of clusters k need not be known in advance, and it may vary over time. Models are continually trainable as a result of the dynamic reception of data over an unknown and potentially indefinite time period, and clusters can be built incrementally and in connection with an updatable distance threshold that indicates when a new cluster is to be created. Distance thresholds also are dynamic and adjustable over time.
申请公布号 US2016342903(A1) 申请公布日期 2016.11.24
申请号 US201514718277 申请日期 2015.05.21
申请人 Software AG USA, Inc. 发明人 SHUMPERT James Michael
分类号 G06N99/00;G06F11/07;G06N5/02 主分类号 G06N99/00
代理机构 代理人
主权项 1. A system for detecting anomalies in data dynamically received from a plurality of sensors associated with one or more machines, the system comprising: a knowledgebase; a model store; one or more interfaces configured to receive data from the plurality of sensors; and processing resources including at least one processor and a memory, the processing resources being configured, for each instance of data received via the one or more interfaces, to at least: classify, using a model retrieved from the model store, the respective instance as being one of a normal instance type and an anomalous instance type, the retrieved model being selected from the model store as being appropriate for the machine that produced the data in the respective instance if such a model exists in the model store;in response to a classification of the respective instance being a normal instance type, use the data in the respective instance to train the retrieved model;in response to a classification of the respective instance being an anomalous instance type that is not new, determine from the knowledgebase an action to be taken and take the determined action; andin response to a classification of the respective instance being an anomalous instance type that is new, seek confirmation from an authorized user as to whether the respective instance should be designated as a confirmed new anomalous instance type, and: responsive to confirmation from the authorized user that the respective instance is a new anomalous instance type, update the knowledgebase with information about the respective instance and/or an action to be taken should the new anomalous instance type be detected again; anduse the data in the respective instance to train the retrieved model; wherein each model in the model store is implemented using a k-means cluster algorithm modified so as to (a) be continually trainable as a result of the dynamic reception of data over an unknown and potentially indefinite time period, and (b) build clusters incrementally and in connection with an updatable distance threshold that indicates when a new cluster is to be created; and wherein each said model has a respective total number of clusters that is dynamic and learned over time.
地址 Reston VA US