期刊文献+

基于强化学习的特征选择方法及材料学应用 被引量:3

Feature selection based on reinforcement learning and its application in material informatics
下载PDF
导出
摘要 随着大数据、人工智能以及高性能计算的快速发展,数据驱动的新材料研发成为研究热点.在对材料数据进行数据挖掘的过程中,需要对特征集合进行预处理,通过减少无关冗余特征,不仅可以避免模型过拟合,还能提高模型的可解释性.基于此,提出了一种基于强化学习的特征选择(feature selection based on reinforcement learning,FSRL)算法,将封装式特征选择抽象成机器学习模型和“环境”互动的过程,并根据利益最大化准则将对应特征加入特征子集中.同时,为了提高模型的预测精度,还提出一种基于符号变换的特征构造方法来生成新的特征.最后,将所提出方法应用到非晶合金材料的分类预测任务和铝基复合材料的回归任务中.实验结果表明,FSRL算法的分类准确率最高提升了2.8%,而在回归任务中,基于特征构造的FSRL算法使得预测精度最高提升了22.9%. Owing the rapid development of big data,artificial intelligence,and highperformance computing,the research and development of data-driven materials has intensified.During data mining and the machine learning of material data,the feature set must be preprocessed by reducing redundant and irrelevant features,which can not only avoid model overfitting,but also improve the model interpretability.Herein,a feature selection method based on reinforcement learning,known as FSRL,is proposed.By abstracting the encapsulated feature selection method into the interaction between the machine learning model and environment,the corresponding features are selected based on the maximum reward and then incorporated to the feature subset.In addition,we propose a feature construction method based on symbolic transformation to generate new high-order features to improve the prediction accuracy of the model.Subsequently,we apply the abovementioned method to the classification task of amorphous alloy materials and the regression task of aluminum matrix composite materials.Experiments show that our proposed method not only successfully achieve feature transformation in the FSRL,but also afford a 2.8%prediction improvement in the classification task and a 22.9%prediction improvement in the regression task respectively.
作者 张鹏 张瑞 ZHANG Peng;ZHANG Rui(School of Computer Engineering and Science,Shanghai University,Shanghai 200444,China;Center of Materials Informatics and Data Science,Materials Genome Institute,Shanghai University,Shanghai 200444,China;Zhejiang Laboratory,Hangzhou 311100,Zhejiang,China)
出处 《上海大学学报(自然科学版)》 CAS CSCD 北大核心 2022年第3期463-475,共13页 Journal of Shanghai University:Natural Science Edition
基金 国家重点研发计划资助项目(2018YFB0704400) 云南省重大科技专项资助项目(202102AB080019-3,202002AB080001-2) 之江实验室科研攻关资助项目(2021PE0AC02) 上海张江国家自主创新示范区专项发展资金重大资助项目(ZJ2021-ZD-006)。
关键词 特征选择 强化学习 特征构造方法 feature selection reinforcement learning feature construction method
  • 相关文献

参考文献13

二级参考文献203

共引文献1091

同被引文献27

引证文献3

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部