摘要
在许多现实应用中,数据以一种特征演化流的形式收集.例如,随着传感器的更换,由旧传感器收集的数据特征会消失,新传感器收集的数据特征会出现.在线被动主动算法已被证明可以有效地从具有固定特征空间和梯形特征空间的数据集中学习线性分类器.因此,提出了一种基于被动主动更新策略的特征演化学习算法(passive-aggressive learning with feature evolvable streams,PAFE).该算法通过主动被动更新策略从当前特征空间和被恢复的已消失特征空间中学习了2个模型.具体来说,在重叠时段,即新旧特征同时存在的时段,该算法用新特征恢复了消失的特征空间,同时用旧特征空间模拟了新特征空间,进而为新特征空间的模型学习提供合理的初始化.基于这2个模型,为提高算法整体性能提出了2个集成算法:组合预测和当前最优预测.在合成数据集和真实数据集上的实验结果验证了该算法的有效性.
In many real-world applications,data are collected in the form of a feature evolvable stream.For instance,old features of data gathered by limited-lifespan sensors disappear and new features emerge at the same time along with the sensors exchanging simultaneously.Online passive-aggressive algorithms have proven to be effective in learning linear classifiers from datasets with both a fixed feature space and a trapezoidal feature space.Therefore,in this paper we propose a new feature evolvable learning based on passive-aggressive update strategy(PAFE),which utilizes the margin to modify the current classifier.The proposed algorithm learns two models through passive-aggressive update strategy from the current features and recovered features of the vanished features.Specifically,we both recover the vanished features and mine the initialization of the current model from the overlapping periods in which both old and new features are available.Furthermore,we use two ensemble methods to improve performance:combining the predictions from the two models,and dynamically selecting the best single prediction.Experiments on both synthetic and real data validate the effectiveness of our proposed algorithm.
作者
刘艳芳
李文斌
高阳
Liu Yanfang;Li Wenbin;Gao Yang(State Key Laboratory for Novel Software Technology(Nanjing University),Nanjing 210023;College of Mathematics and Information Engineering,Longyan University,Longyan,Fujian 364012)
出处
《计算机研究与发展》
EI
CSCD
北大核心
2021年第8期1575-1585,共11页
Journal of Computer Research and Development
基金
国家重点研发计划项目(2018AAA0100905)
福建省中青年教师教育科研项目(科技类)(JAT190743)
龙岩市科技计划项目(2019LYF13002,2019LYF12010)。
关键词
在线学习
被动主动策略
监督学习
集成学习
演化特征
online learning
passive-aggressive strategy
supervised learning
ensemble learning
evolvable features