发明名称 High dimensional data mining and visualization via gaussianization
摘要 A method is provided for providing high dimensional data. The high dimensional data is linearly transformed into less dependent coordinates, by applying a linear transform of n rows by n columns to the high dimensional data. Each of the coordinates are marginally Gaussianized, the Gaussianization being characterized by univariate Gaussian means, priors, and variances. The transforming and Gaussianizing steps are iteratively repeated until the coordinates converge to a standard Gaussian distribution. The coordinates of all iterations are arranged hierarchically to facilitate data mining. The arranged coordinates are then mined. According to an embodiment of the invention, the transform step includes applying an iterative maximum likelihood expectation maximization (EM) method to the high dimensional data.
申请公布号 US6591235(B1) 申请公布日期 2003.07.08
申请号 US20000565365 申请日期 2000.05.05
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 CHEN SCOTT SHAOBING;GOPINATH RAMESH AMBAT
分类号 G06K9/62;(IPC1-7):G10L11/00 主分类号 G06K9/62
代理机构 代理人
主权项
地址