发明名称 Systems, methods, and media for outputting data based upon anomaly detection
摘要 Systems, methods, and media for outputting data based on anomaly detection are provided. In some embodiments, a method for outputting data based on anomaly detection is provided, the method comprising: receiving, using a hardware processor, an input dataset; identifying grams in the input dataset that substantially include distinct byte values; creating an input subset by removing the identified grams from the input dataset; determining whether the input dataset is likely to be anomalous based on the identified grams, and determining whether the input dataset is likely to be anomalous by applying the input subset to a binary anomaly detection model to check for an n-gram in the input subset; and outputting the input dataset based on the likelihood that the input dataset is anomalous.
申请公布号 US9003523(B2) 申请公布日期 2015.04.07
申请号 US201313891031 申请日期 2013.05.09
申请人 The Trustees of Columbia University in the City of New York 发明人 Stolfo Salvatore J;Wang Ke;Parekh Janak
分类号 G06F21/00;H04L29/06 主分类号 G06F21/00
代理机构 Byrne Poh LLP 代理人 Byrne Poh LLP
主权项 1. A method for outputting data based on anomaly detection, comprising: receiving, using a hardware processor, an input dataset; identifying commonly occurring grams in the input dataset that substantially include distinct byte values; creating an input subset by removing the identified grams from the input dataset; determining whether the input dataset is likely to be anomalous based on the identified grams, and determining whether the input dataset is likely to be anomalous by applying the input subset to a binary anomaly detection model to check for an n-gram in the input subset, wherein the binary detection model is represented using a Bloom filter; and outputting the input dataset based on the likelihood that the input dataset is anomalous.
地址 New York NY US