###
计算机系统应用英文版:2022,31(10):225-235
本文二维码信息
码上扫一扫!
基于属性分割的差分隐私异构多属性数据发布
(1.南京航空航天大学 计算机科学与技术学院, 南京 211106;2.南京航空航天大学 高安全系统的软件开发与验证技术工业和信息化部重点实验室, 南京 211106;3.南京大学 软件新技术与产业化协同创新中心, 南京 210093)
Differentially Private Heterogeneous Multi-attribute Data Publication via Attribute Segmentation
(1.College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China;2.Key Laboratory of Safety-critical Software, Ministry of Industry and Information Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China;3.Collaborative Innovation Center of Novel Software Technology and Industrialization, Nanjing University, Nanjing 210093, China)
摘要
图/表
参考文献
相似文献
本文已被:浏览 467次   下载 1073
Received:January 06, 2022    Revised:February 17, 2022
中文摘要: 针对现有多属性数据隐私发布方法无法兼顾属性的敏感性差异和计算效率低的问题, 提出了一种基于属性分割的差分隐私异构多属性数据发布方法HMPrivBayes. 首先, 设计了满足差分隐私的谱聚类算法分割原始数据集, 其中相似矩阵的生成借助于属性最大信息系数. 其次, 借助属性信息, 该方法使用满足差分隐私的改进贝叶斯网络构建算法分别为每个数据子集构建贝叶斯网络. 最后, 以属性归一化风险熵为权重分配隐私预算, 对贝叶斯网络提取的属性联合分布添加异构噪声扰动, 实现了异构多属性数据保护. 实验结果表明, HMPrivBayes可以在减少注入合成数据集中噪声量的同时, 提高合成数据计算效率.
Abstract:Multi-attribute data privacy publication fails to balance the difference in attribute sensitivity and computational efficiency. For this reason, HMPrivBayes, a heterogeneous multi-attribute data publishing method with differential privacy based on attribute segmentation, is proposed. Firstly, the spectral clustering algorithm satisfying differential privacy is designed to segment the original data set, in which the similarity matrix is generated by the attribute maximum information coefficient. Secondly, with the help of attribute information, this method uses an improved Bayesian network construction algorithm to build Bayesian networks for each data subset. Finally, HMPrivBayes adds heterogeneous noise disturbance to the attribute joint distribution extracted from the Bayesian network to realize the protection of heterogeneous multi-attribute data, in which privacy budget is allocated based on the normalized risk entropy of attribute. The experimental results show that HMPrivBayes not only reduces the added noise but also improves the computational efficiency of synthetic data.
文章编号:     中图分类号:    文献标志码:
基金项目:国家自然科学基金(61772270)
引用文本:
张小玉,沈国华,杨阳.基于属性分割的差分隐私异构多属性数据发布.计算机系统应用,2022,31(10):225-235
ZHANG Xiao-Yu,SHEN Guo-Hua,YANG Yang.Differentially Private Heterogeneous Multi-attribute Data Publication via Attribute Segmentation.COMPUTER SYSTEMS APPLICATIONS,2022,31(10):225-235