期刊文献+

一种隐私保护关联规则挖掘的混合算法 被引量:12

Hybrid algorithm for privacy preserving association rules mining
下载PDF
导出
摘要 针对现有的隐私保护关联规则挖掘算法无法满足效率与精度之间较好的折中问题,提出了一种基于安全多方计算与随机干扰相结合的混合算法。算法基于半诚实模型,首先使用项集随机干扰矩阵对各个分布站点的数据进行变换和隐藏,然后提出一种方法恢复项集的全局支持数。由于采用的是对项集进行干扰,克服了传统方法由于独立地干扰每个项而破坏项之间相关性,导致恢复精度下降的缺陷。将小于阈值的项集进行剪枝,再使用安全多方计算在剪枝后的空间中精确找出全局频繁项集,进而生成全局关联规则。实验表明,该算法在保持隐私度的情况下,能够获得精度和效率之间较好的折中。 To solve the problem that the existing privacy preserving association rule mining algorithm can not meet better trade-off between efficiency and accuracy, thia paper proposed a new hybrid algorithm based on secure multi-party computation combined with randomization approach. The algorithm based on semi-honest model and used the itemset random perturbation matrix to disturb and hide the data of distribution sites firstly, then it proposed a method to estimate the global support of candidate itemsets. Due to it disturbed the itemsets, overcoming the defect of traditional algorithms which disturbed each item independently and made the result inaccuracy. Then it pruned to reduce the search space of candidate frequent itemsets. Secondly, it used the secure multi-party computation to find the precise global frequent itemsets in the reduced search space. The experimental results show that the proposed algorithm can obtain better trade-off between accuracy and efficiency while maintain the privacy.
出处 《计算机应用研究》 CSCD 北大核心 2012年第3期1107-1110,共4页 Application Research of Computers
基金 省科技型企业创新资金资助项目(BC2010172) 高校博士点基金资助项目(20093227110005) 江苏大学高级人才启动基金资助项目(09JDG041)
关键词 隐私保护 关联规则挖掘 安全多方计算 项集随机干扰矩阵 privacy preserving association rules mining SMC itemsets random perturbation matrix
  • 相关文献

参考文献9

  • 1AGRAWAL R, SRIKANT R. Privacy-preserving data mining [ C ]// Proc of ACM SIGMOD Conference on Management of Data. 2000:439- 450. 被引量:1
  • 2周水庚,李丰,陶宇飞,肖小奎.面向数据库应用的隐私保护研究综述[J].计算机学报,2009,32(5):847-861. 被引量:220
  • 3KANTARCIOGLU M, CLIFTION C. Privacy-preserving distributed mining of association rules on horizontally partitioned data [ J]. IEEE Trans on Knowledge and Data Engineering,2004,16 (9) :1026- 1037. 被引量:1
  • 4HUSSEIN M, EL-SISI A, ISMAIL N. Fast eryptographic privacy pre- serving association rules mining on distributed homogenous data base [ C]//LNCS, vol 5178. 2008:607-616. 被引量:1
  • 5KUNO S Y, DOI K, YAMAMOTO A. Frequent closed itemset mining with privacy for distributed databases [ C ]//Proc of IEEE International Conference on Data Mining Workshops. 2010:483-490. 被引量:1
  • 6张鹏,童云海,唐世渭,杨冬青,马秀莉.一种有效的隐私保护关联规则挖掘方法[J].软件学报,2006,17(8):1764-1774. 被引量:53
  • 7TENG Zhou-xuan, DU Wen-liang. A hybrid muhi-group approach for privacy-preserving data mining [ J]. Knowledge and Information Systems ,2009,19 ( 2 ) : 133-157. 被引量:1
  • 8SHASKANKA M. A privacy preserving framework for Gaussian mix- ture models [ C ]//Proc of IEEE International Conference on Data Mining Workshops. Washington DC : IEEE Computer Society, 2010 : 499-506. 被引量:1
  • 9HANJia-wei KAMBERM.数据挖掘概念与技术[M].北京:机械工业出版社,2001.1 51-161. 被引量:36

二级参考文献87

共引文献297

同被引文献160

引证文献12

二级引证文献75

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部