主权项 |
1. A method for detecting quasi-identifiers in a dataset using a set of computing tasks, the dataset having a plurality of records and further having a set of attributes, each record having an attribute value for each attribute in the set of attributes, the method comprising:
generating a first index for the dataset, the first index having an index indicator for each attribute value of each record, each index indicator specifying a set of records, the specified set of records including each record in the plurality of records having the same attribute value for the associated attribute as the associated record; assigning an attribute combination to each task in the set of computing tasks, the attribute combination for each task including one or more attributes of the set of attributes; assigning a subset of the plurality of records to each task in the set of computing tasks; detecting at least one quasi-identifier by passing each task to at least one thread for execution on computing resources, the execution of each task comprising inspecting the index indicator for each attribute value in the assigned attribute combination of at least a portion of the assigned subset of the plurality of records to produce a result, the result of at least one task identifying a unique record for the associated attribute combination, the attribute values in the attribute combination for the unique record different from the attribute values in the attribute combination for all other records in the plurality of records, the at least one quasi-identifier being the attribute combination assigned to the at least one task identifying a unique record. |