发明名称 Method and system for preserving privacy of a dataset
摘要 A method and a system for preserving privacy of a dataset are provided. In the method, a k-anonymity value with respect to a sensitive data field is determined according to at least one first quasi-identifier. Data entries in each group have the same value in the one or more fields of the first quasi-identifier and data entries in different groups have different values in the one or more fields of the first quasi-identifier. A first group and a second group among the plurality of groups are determined according to the reference number Kr, where the first group and the second group are merged into a merging group. The number of data entries in the merging group is not less than a reference number Kr. One or more fields of at least one first quasi-identifier is masked for the merging group.
申请公布号 US8812524(B2) 申请公布日期 2014.08.19
申请号 US201213586891 申请日期 2012.08.16
申请人 Industrial Technology Research Institute;International Business Machines Corporation 发明人 Chen Ya-Ling;Lan Ci-Wei;Grandison Tyrone W;Hsiao Jen-Hao;Tseng Li-Feng;Chen Yi-Hui
分类号 G06F17/30;G06F21/62 主分类号 G06F17/30
代理机构 Jianq Chyun IP Office 代理人 Jianq Chyun IP Office
主权项 1. A method for preserving privacy of a dataset, where the dataset has at least a sensitive data field and one or more fields of at least one first quasi-identifier, the method comprising: determining a k-anonymity value K with respect to the sensitive data field according to the at least one first quasi-identifier; determining to adopt the at least one first quasi-identifier to categorize the dataset into a plurality of groups, if the k-anonymity value K is less than a reference number Kr, wherein data entries in each of the plurality of groups have the same value in the one or more fields of at least one first quasi-identifier and data entries in different groups of the plurality of groups have different values in the one or more fields of at least one first quasi-identifier; determining the number of data entries in each of the plurality of groups; determining a first group among the plurality of groups, wherein the number of data entries, N1, in the first group is less than the reference number Kr; determining a second group among the plurality of groups, whereby when the first group and the second group are merged into a merging group, the number of data entries, Nm, in the merging group is not less than the reference number Kr; and masking the one or more fields of at least one first quasi-identifier for the merging group.
地址 Hsinchu TW