One-class classification problem has become a popular problem in many fields, with a wide range of applications in anomaly detection, fault diagnosis, and face recognition. We investigate the one-class classification ...One-class classification problem has become a popular problem in many fields, with a wide range of applications in anomaly detection, fault diagnosis, and face recognition. We investigate the one-class classification problem for second-order tensor data. Traditional vector-based one-class classification methods such as one-class support vector machine (OCSVM) and least squares one-class support vector machine (LSOCSVM) have limitations when tensor is used as input data, so we propose a new tensor one-class classification method, LSOCSTM, which directly uses tensor as input data. On one hand, using tensor as input data not only enables to classify tensor data, but also for vector data, classifying it after high dimensionalizing it into tensor still improves the classification accuracy and overcomes the over-fitting problem. On the other hand, different from one-class support tensor machine (OCSTM), we use squared loss instead of the original loss function so that we solve a series of linear equations instead of quadratic programming problems. Therefore, we use the distance to the hyperplane as a metric for classification, and the proposed method is more accurate and faster compared to existing methods. The experimental results show the high efficiency of the proposed method compared with several state-of-the-art methods.展开更多
在解决单分类问题的支持向量数据描述算法的基础上提出了适用于两类不平衡问题的I-SVDD(imbalance-support vector date description)算法.该算法通过增加样本的分布信息,对带野值的SVDD算法中的C值重新进行了定义.采用该算法对UC I数...在解决单分类问题的支持向量数据描述算法的基础上提出了适用于两类不平衡问题的I-SVDD(imbalance-support vector date description)算法.该算法通过增加样本的分布信息,对带野值的SVDD算法中的C值重新进行了定义.采用该算法对UC I数据集和人工样本集进行实验表明,改进后的I-SVDD算法比带野值的SVDD算法的AUC值平均提高12%以上;比AdaBoost算法在正类查全率上平均提高35%,精确度也提高了2%以上.I-SVDD算法在保证少数类样本高分类精度前提下,还有效提高了全样本的分类精度,更符合现实不平衡问题中对少数类样本的处理要求.展开更多
异常检测是模式识别领域的经典研究,然而在极端类别不平衡场景下,异常样本匮乏,训练数据仅包含正常样本,传统异常检测方法难以适用。因此,单分类算法逐渐受到关注,它只使用目标类样本构建决策边界,实现对非目标类样本的识别。目前单分...异常检测是模式识别领域的经典研究,然而在极端类别不平衡场景下,异常样本匮乏,训练数据仅包含正常样本,传统异常检测方法难以适用。因此,单分类算法逐渐受到关注,它只使用目标类样本构建决策边界,实现对非目标类样本的识别。目前单分类算法已经取得了显著进展,然而也存在一些局限性:(1)原始特征空间容易受噪声特征干扰;(2)单模型的单分类算法难以从多个特征空间学习更全面的决策边界;(3)缺少对先前模型的欠拟合样本进行针对性学习。为了解决这些问题,本文提出了基于宽度自编码网络的单分类集成算法(Ensemble One-class Classification Based on BLS-Autoencoder,EOC-BLSAE)。首先,本文设计了一种单分类宽度自编码网络模型(One-class BLS-Autoencoder,OC-BLSAE),它能高效学习原始特征空间到重构特征空间的非线性映射关系,利用重构误差构建决策边界;接着,本文设计了单分类Boosting策略,通过最小化全局重构损失,迭代学习欠拟合样本,从而多角度构建OC-BLSAE模型,并自适应评估模型的可靠性;最终,加权集成多个OC-BLSAE模型,有效提升整体算法准确性和鲁棒性。在实验中,本文在16个不同规模的单分类任务上进行参数实验、对比实验和消融实验,结果表明所提算法参数选择较为灵活,算法各模块能够相互协同,有效提升单分类任务的准确性和鲁棒性,整体性能超过前沿单分类方法。展开更多
文摘One-class classification problem has become a popular problem in many fields, with a wide range of applications in anomaly detection, fault diagnosis, and face recognition. We investigate the one-class classification problem for second-order tensor data. Traditional vector-based one-class classification methods such as one-class support vector machine (OCSVM) and least squares one-class support vector machine (LSOCSVM) have limitations when tensor is used as input data, so we propose a new tensor one-class classification method, LSOCSTM, which directly uses tensor as input data. On one hand, using tensor as input data not only enables to classify tensor data, but also for vector data, classifying it after high dimensionalizing it into tensor still improves the classification accuracy and overcomes the over-fitting problem. On the other hand, different from one-class support tensor machine (OCSTM), we use squared loss instead of the original loss function so that we solve a series of linear equations instead of quadratic programming problems. Therefore, we use the distance to the hyperplane as a metric for classification, and the proposed method is more accurate and faster compared to existing methods. The experimental results show the high efficiency of the proposed method compared with several state-of-the-art methods.
文摘在解决单分类问题的支持向量数据描述算法的基础上提出了适用于两类不平衡问题的I-SVDD(imbalance-support vector date description)算法.该算法通过增加样本的分布信息,对带野值的SVDD算法中的C值重新进行了定义.采用该算法对UC I数据集和人工样本集进行实验表明,改进后的I-SVDD算法比带野值的SVDD算法的AUC值平均提高12%以上;比AdaBoost算法在正类查全率上平均提高35%,精确度也提高了2%以上.I-SVDD算法在保证少数类样本高分类精度前提下,还有效提高了全样本的分类精度,更符合现实不平衡问题中对少数类样本的处理要求.
文摘异常检测是模式识别领域的经典研究,然而在极端类别不平衡场景下,异常样本匮乏,训练数据仅包含正常样本,传统异常检测方法难以适用。因此,单分类算法逐渐受到关注,它只使用目标类样本构建决策边界,实现对非目标类样本的识别。目前单分类算法已经取得了显著进展,然而也存在一些局限性:(1)原始特征空间容易受噪声特征干扰;(2)单模型的单分类算法难以从多个特征空间学习更全面的决策边界;(3)缺少对先前模型的欠拟合样本进行针对性学习。为了解决这些问题,本文提出了基于宽度自编码网络的单分类集成算法(Ensemble One-class Classification Based on BLS-Autoencoder,EOC-BLSAE)。首先,本文设计了一种单分类宽度自编码网络模型(One-class BLS-Autoencoder,OC-BLSAE),它能高效学习原始特征空间到重构特征空间的非线性映射关系,利用重构误差构建决策边界;接着,本文设计了单分类Boosting策略,通过最小化全局重构损失,迭代学习欠拟合样本,从而多角度构建OC-BLSAE模型,并自适应评估模型的可靠性;最终,加权集成多个OC-BLSAE模型,有效提升整体算法准确性和鲁棒性。在实验中,本文在16个不同规模的单分类任务上进行参数实验、对比实验和消融实验,结果表明所提算法参数选择较为灵活,算法各模块能够相互协同,有效提升单分类任务的准确性和鲁棒性,整体性能超过前沿单分类方法。