摘要 |
<p>Multidimensional data is reduced by measuring n observations out of a first data set including discrete independent variables, such as genetic or environmental factors. A number of observations in which a dependent variable has a first value is determined, such as a disease-indicative value, and a number of observations in which the dependent variable has a second value, such as a disease-free value, is determined. Combinations are formed of the independent variables to produce a second data set. In each combination of independent variables, a ratio of the number of observations with the dependent variable having the first value to the number of observations with the dependent variable having the second value is determined. Each ratio is compared to one or more thresholds or ranges of thresholds to determine which combination of independent variables optimally discriminates observations with the dependent variable having the first value and observations with the dependent variable having the second value. Those combinations having ratios at the approximately the same threshold or within the same range of thresholds are pooled together to produce a third data set having a smaller number of independent variable combinations than the second data set.</p> |