摘要
随着海量数据不断涌入,SVM隐私泄露问题日益严重。在分析已有隐私保护支持向量机基础上,提出一种面向大规模数据的隐私保护学习机(PPLM)。该方法首先通过核心向量机对大规模样本进行采样,然后在核心集上选取两个样本点并将两点连线的法平面作为最优分类面。通过对标准数据集和人工数据集的实验表明,PPLM可有效地解决大规模样本分类问题,且分类效果良好。
Support vector machine (SVM) is widely used in pattern classification. In order to solve the privacy preserving problem in SVM, a privacy preserving learning machine for large scale datasets (PPLM) is proposed in this paper. First, core vector machine (CVM) is introduced for sampling the large scale datasets; then two points from different classes are ehosen in the core set and the hyperplane orthogonal to the line connecting these two points is treated as the optimal separating hyperplane. Experimental results obtained from synthetic and standard datasets verify that the PPLM is effective and competitive.
出处
《电子科技大学学报》
EI
CAS
CSCD
北大核心
2013年第2期272-276,共5页
Journal of University of Electronic Science and Technology of China
基金
国家863项目(2007AA1Z158
2006AA10Z313)
国家自然科学基金(60773206
60704047)
关键词
大规模数据集
模式分类
隐私保护
支持向量机
large scale datasets
pattern classification
privacy preserving
support vector machine