期刊文献+

针对k-匿名数据的判定树构造算法

An Algorithm of Decision Tree for k-Anonymity Data
原文传递
导出
摘要 数据挖掘问题是提高k-匿名隐私保护模型下数据可用性问题之一.通过分析发现,k-匿名表中准标识符属性值与利用精确表生成的判定树的部分非叶结点的属性值均是通过泛化产生的,根据这一对应关系,本文提出了一种基于k-匿名表的判定树生成算法.该算法直接以k-匿名表作为输入,避免了经典ID3算法运行前的数据准备工作.实验表明,该算法节省了建立概化层次树的时间,并且行之有效. Data mining is one of problems for the utility of anonymized data under the k-anonymity privacy protection model.Through analysis,we find that both the quasi-identifier attribute values in the k-anonymity table and the node except leaf of the decision tree in the private table are needed to generalize.According to this correspondence,we propose a decision tree algorithm based on k-anonymity.The algorithm accepts the k-anonymity table as input to avoid the ID3algorithm data preparation work before running.Experimental results show that the algorithm saves the time which is used to build generalize tree and it is efficient for k-anonymity data table.
出处 《武汉大学学报(理学版)》 CAS CSCD 北大核心 2011年第6期494-498,共5页 Journal of Wuhan University:Natural Science Edition
基金 国家自然科学基金资助项目(61070032)
关键词 K-匿名 判定树 ID3 不确定 数据挖掘 k-anonymity decision tree ID3 uncertain data mining
  • 相关文献

参考文献13

  • 1Samarati P,Sweeney L. Protecting Privacy When Dis- closing Information: k-anonymity and Its Enforcement through Generalization and Suppression [R/OL]. [2010-12-12]. http://citeseerx, ist. psu. edu/viewdoc/ summary? doi= lO. 1.1.37. 5829. 被引量:1
  • 2Sweeney L. k-Anonymity:A model for protecting pri- vacy [DB/OL]. [2010-12-201. http://citeseerx, ist. psu. edu/ viewdoc / download. 被引量:1
  • 3Cormode G, Srivastava D. Anonymized data: Genera- tion, models, usage [ DB/OL]. [ 2010-12-25 ]. hltp ..// dimacs, rulgers, edu/~ graham/pubs/papers/anon- tutl O. pd f . 被引量:1
  • 4Chui C K, Kao B, Hung E. Mining frequent itemsets from uncertain data[DB/OL]. [2010-12-23]. http:// www. philippe- fournier-viger, com/spmf /uapriori. pdf. 被引量:1
  • 5Li Y F, Han J W, Yang J. Clustering moving objects [DB/OL]. [2010-12-20]. http://www, cs. uiuc. edu/ ~hanj/ pd f /kddO4_clusmovobj. pd f . 被引量:1
  • 6Ngai W,Kao B,Chui C K,et al. Efficient clustering of uncertain data [DB/OL]. [ 2010-12-21 ]. http ://www. fbe. hku. hk /- mchau / papers / E f f icientClusterin gO- fUncertainData, pd f . 被引量:1
  • 7Cormode G, McGregor A. Approximation Algorithms for Clustering Uncertain Data[DB/OL]. [2010-12-23]. http://www, research, att. com/people/Cormode_ Graham/ library/ publications/ CormodeMcGregor08. pdf. 被引量:1
  • 8Guha S,Munagala K. Exceeding expectations and clus- tering uncertain data[DB/OL]. [2010-12-23]. http,// www. cs. duke. edu/~kamesh/ podsO9-guha, pd f . 被引量:1
  • 9韩佳炜.数据挖掘:概念与技术[M].北京:机械工业出版社,2007:162-172. 被引量:1
  • 10Newman D J, Blake C L, Merz C J. UCI repository of machine learning database[DB/OL]. [ 2010-12-21 ]. http://www, ics. uci. edu/-mlearn/. 被引量:1

二级参考文献19

共引文献10

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部