发明名称 |
SYSTEMS, METHODS, AND MEDIA FOR OUTPUTTING A DATASET BASED UPON ANOMALY DETECTION |
摘要 |
Systems, methods, and media for outputting a dataset based upon anomaly detection are provided. In some embodiments, methods for outputting a dataset based upon anomaly detection: receive a training dataset having a plurality of n-grams, which plurality includes a first plurality of distinct training n-grams each being a first size; compute a first plurality of appearance frequencies, each for a corresponding one of the first plurality of distinct training n-grams; receive an input dataset including first input n-grams each being the first size; define a first window in the input dataset; identify as being first matching n-grams, the first input n-grams in the first window that correspond to the first plurality of distinct training n-grams; compute a first anomaly detection score for the input dataset using the first matching n-grams and the first plurality of appearance frequencies; and output the input dataset based on the first anomaly detection score. |
申请公布号 |
WO2007100916(A2) |
申请公布日期 |
2007.09.07 |
申请号 |
WO2007US05408 |
申请日期 |
2007.02.28 |
申请人 |
THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK;STOLFO, SALVATORE, J.;WANG, KE;PAREKH, JANAK |
发明人 |
STOLFO, SALVATORE, J.;WANG, KE;PAREKH, JANAK |
分类号 |
|
主分类号 |
|
代理机构 |
|
代理人 |
|
主权项 |
|
地址 |
|