发明名称 SYSTEM AND METHODS FOR ANALYSIS OF DATA
摘要 Data processing including a universal metric to quantify and estimate the similarity and dissimilarity between data sets. Data streams are perfectly annihilated by a correct realization of their anti-streams. Any deviation of the collision product from a baseline, for example flat white noise, quantifies statistical dissimilarity. The invention relates generally to data mining. More specifically, the invention relates to the analysis of data using a universal metric to quantify and estimate the similarity and dissimilarity between sets of data.
申请公布号 US2015242469(A1) 申请公布日期 2015.08.27
申请号 US201314431131 申请日期 2013.09.27
申请人 CORNELL UNIVERSITY 发明人 Chattopadhyay Ishanu;Lipson Hod
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A computer method for analyzing data, comprising the steps of: encoding a first data set to obtain a first encoded data set; encoding a second data set to obtain a second encoded data set; inverting the second encoded data set to obtain an inverted data set; performing summation of the first encoded data set and the inverted data set to generate a summed data set; encoding a baseline data set to obtain a baseline encoded data set; comparing the summed data set to the baseline encoded data set; and identifying one or more dissimilarities between the first data set and the second data set.
地址 Ithaca NY US