发明名称 |
SYSTEM AND METHODS FOR ANALYSIS OF DATA |
摘要 |
Data processing including a universal metric to quantify and estimate the similarity and dissimilarity between data sets. Data streams are perfectly annihilated by a correct realization of their anti-streams. Any deviation of the collision product from a baseline, for example flat white noise, quantifies statistical dissimilarity. The invention relates generally to data mining. More specifically, the invention relates to the analysis of data using a universal metric to quantify and estimate the similarity and dissimilarity between sets of data. |
申请公布号 |
US2015242469(A1) |
申请公布日期 |
2015.08.27 |
申请号 |
US201314431131 |
申请日期 |
2013.09.27 |
申请人 |
CORNELL UNIVERSITY |
发明人 |
Chattopadhyay Ishanu;Lipson Hod |
分类号 |
G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
1. A computer method for analyzing data, comprising the steps of:
encoding a first data set to obtain a first encoded data set; encoding a second data set to obtain a second encoded data set; inverting the second encoded data set to obtain an inverted data set; performing summation of the first encoded data set and the inverted data set to generate a summed data set; encoding a baseline data set to obtain a baseline encoded data set; comparing the summed data set to the baseline encoded data set; and identifying one or more dissimilarities between the first data set and the second data set. |
地址 |
Ithaca NY US |