发明名称 Local Causal and Markov Blanket Induction Method for Causal Discovery and Feature Selection from Data
摘要 In many areas, recent developments have generated very large datasets from which it is desired to extract meaningful relationships between the dataset elements. However, to date, the finding of such relationships using prior art methods has proved extremely difficult especially in the biomedical arts. Methods for local causal learning and Markov blanket discovery are important recent developments in pattern recognition and applied statistics, primarily because they offer a principled solution to the variable/feature selection problem and give insight about local causal structure. The present invention provides a generative method for learning local causal structure around target variables of interest in the form of direct causes/effects and Markov blankets applicable to very large datasets and relatively small samples. The method is readily applicable to real-world data, and the selected feature sets can be used for causal discovery and classification. The generative method GLL-PC can be instantiated in many ways, giving rise to novel method variants. In general, the inventive method transforms a dataset with many variables into either a minimal reduced dataset where all variables are needed for optimal prediction of the response variable or a dataset where all variables are direct causes and direct effects of the response variable. The power of the invention and significant advantages over the prior art were empirically demonstrated with datasets from a diversity of application domains (biology, medicine, economics, ecology, digit recognition, text categorization, and computational biology) and data generated by Bayesian networks.
申请公布号 US2011307437(A1) 申请公布日期 2011.12.15
申请号 US20100700689 申请日期 2010.02.04
申请人 ALIFERIS KONSTANTINOS (CONSTANTIN) F.;STATNIKOV ALEXANDER 发明人 ALIFERIS KONSTANTINOS (CONSTANTIN) F.;STATNIKOV ALEXANDER
分类号 G06N5/02 主分类号 G06N5/02
代理机构 代理人
主权项
地址