摘要 |
A discretization method for a database attribute containing a population of individuals, said attribute known as the source attribute, capable of assuming several modalities, the method characterized by an initial stage in which said source attribute modalities are regrouped into elementary groups, and a source and a target attribute contingency table is used to determine from among a set of elementary group pairs in a second stage the pair of elementary groups whose merger most extensively decreases the probability of independence of the source and the target attribute, and in a third stage the pair of elementary groups thus determined is merged, said second and third stages being iterative inasmuch as there is a pair of elementary groups allowing for said probability of independence to be decreased.
|