An Enhanced K-Anonymity Model against Homogeneity Attack
Abstract
k-anonymity is an important model in the field of privacy protection and it is an effective method to prevent privacy disclosure in micro-data release. However, it is ineffective for the attribute disclosure by the homogeneity attack. The existing models based on k-anonymity have solved this problem to a certain extent, but they did not distinguish the different values of the sensitive attribute, processed a series of unnecessary generalization and expanded the information loss when they protect the sensitive attribute. Based on k-anonymity, this paper proposed a model based on average leakage probability and probability difference of sensitive attribute value. It is not only an effective method to deal with the problem of attributes disclosure that k-anonymity cannot deal, but also to realize different levels of protection to the various sensitive attribute values. It has reduced the generalization to the data in the most possibility during the procedure and ensures the most effectiveness of quasi-identifier attributes. Greedy generalization algorithm based on the generalization information loss is also proposed in this paper. To choose the generalization attributes, the information loss is considered and the importance of generalization attribute to sensitive attribute is accounted as well. Comparison experiment and performance experiment are made to the proposed model. The experiment results show that the model is feasible.
Keywords
References
[1] Dan Zhu, Xiao-Bai Li, Shuning Wu. “Identity disclosure protection: A data reconstruction approach for privacy-preserving data mining”. Decision Support Systems. December 2009, Volume 48, Pages 133-140
http://dx.doi.org/10.1016/j.dss.2009.07.003
[2] Sweeney L. “k-anonymity: a model for protecting privacy”. International Journal on Uncertainty, Fuzziness and Kno-wledge-based Systems, 2002, 10(5): 557-570.
http://dx.doi.org/10.1142/S0218488502001648
[3] Sweeney L. “Achieving k-anonymity privacy protection using generalization and suppression”. International Journal on Uncertainty, Fuzziness and Knowledge-based Systems, 2002, 10(5): 571-588
http://dx.doi.org/10.1142/S021848850200165X
[4] T.M.Truta and B.Vinay. “Privacy protection: p-sensitive k-anonymity property”. In Proceedings of the 22nd Internati-onal Conference on Data Engineering Workshops, the Second Internation Work on Privccy Data Management (PDM’06), 2006, 94
[5] X.xun, S.Hua and W.Jiuyong Li. “(p+, α)-sensitive k-anonymity: A new enhanced privacy protection model”. In Pr-oceedings 2008 IEEE 8th International Conference on Computer and Information Technology, 2008, 59-64
[6] A.Machanavajjhala, J.Gehrke, and D.Kifer. “l-diversity: Privacy beyond k-anonymity”. Proc.22nd Intnl. Conf.DataEngg. (ICDE), 2006, 24
[7] Xiao sun, Min Li, Hua Wang. “A family of enhanced models for privacy preserving data publishing”. Future Generation Computer Systems. 2011, Pages 348-356
[8] Qian Wang, Xiangling Shi. “(a, d)-Diversity: Privacy Protection Based on l-Diversity”. 2009 WRI World Congress on Software Engineering, WCSE (2009)
[9] W.Chiwing, L.Jiuyong and W.Ke. “(a, k)-Anonymity: an enhanced k-anonymity model for privacy preserving data publishing”. In Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Da-ta mining, 2006, 754-759
[10] Yan Zhao, Jian Wang, Yongcheng Luo. “(α, β, k)-anonymity: An effective privacy preserving model for databases”. Test and Measurement, 2009. ICTM '09. International Conference on. Page(s): 412 – 415
[11] J. Li, Y. Tao, and X. Xiao. “Preservation of proximity privacy in publishing numeric sensitive data”. In SIGMOD, 2008, 473–486
[12] Tiancheng Li, Ninghui Li, jian Zhang. “Modeling and integrating background knowledge in data anonymization”. Proceedings - International Conference on Data Engineering, 2009, 6-17, Proceedings - 25th IEEE International Conference on Data Engineering, ICDE (2009)
[13] Yingjie Wu, Zhihui Sun, Xiaodong Wang. “Privacy Preserving k-anonymity for Re-publication of Incremental Datasets”. Computer Science and Information Engineering, 2009 WRI World Congress on. March 31 2009-April 2 2009. page(s): 53 – 60
[14] X.Xiao and Y. Tao. “Personalized privacy preservation”.[C].New York:ACM Press, 2006, 229-240.
[15] S.Hettich, C.L.Blake, and C.J.Merz, UCI repository of machine learning databases, Available at http://archive.ics.uci.edu/ml/datasets/Adult, University of California, Irvine, 2008
Full Text: PDF


