摘要 |
A system and method are provided for discovering significant patterns from a list of records in a dataset. Each record includes a set of items, and each significant pattern includes a subset of items such that a significance of the pattern exceeds a significance level. A significance is computed for each item in the list of records to determine significant items. The records are randomly sampled to select a sample portion of the records. Ambiguous patterns are identified against the sample portion of the records and verified against the entire list of records in the dataset. |