发明名称 BINARY PREDICTION TREE MODELING WITH MANY PREDICTORS
摘要 The statistical analysis of the invention is a predictive statistical tree model that overcomes several problems observed in prior statistical models and regression analyses, while ensuring greater accuracy and predictive capabilities. The claimed model can be used for a variety of applications including the prediction of disease states, susceptibility of disease states or any other biological state of interest, as well as other applicable non-biological states of interest. The model as applied to genetic applications generates a statistically significant number of cluster-derived singular factors called metagenes, that characterize multiple patterns of expression of the genes across samples. Formal predictive analysis then uses the metagenes in a Bayesian classification tree analysis which generates multiple recursive partitions of the sample into subgroups (the "leaves" of the classification tree), and associates Bayesian predictive probabilities of outcomes with each subgroup. Overall predictions for an individual sample are then generated by averaging predictions, with appropriate weights, across many such tree models. The model includes the use of iterative out-of-sample, cross-validation predictions leaving each sample out of the data set one at a time, refitting the model from the remaining samples and using it to predict the hold-out case. This rigorously tests the predictive value of a model and mirrors the real-world prognostic context where prediction of new cases as they arise is the major goal.
申请公布号 EP1565877(A1) 申请公布日期 2005.08.24
申请号 EP20020795699 申请日期 2002.11.12
申请人 DUKE UNIVERSITY 发明人 WEST, MICHAEL
分类号 G06G7/48;G06G7/58;G06N3/00;G06N5/02;G06N7/00 主分类号 G06G7/48
代理机构 代理人
主权项
地址