期刊文献+

机器学习相关技术在以黄酮为特征的黄芪分类中的应用研究 被引量:1

Research on the application of machine learning related techniques in the classification of Astragali Radix characterized by flavonoids
原文传递
导出
摘要 目的:建立以黄酮类成分为特征的栽培黄芪、半野生黄芪和野生黄芪的三分类模型,并且对自动机器学习技术和数据增强技术在药物分析领域中的应用进行探索和评价。方法:首先,对黄芪的黄酮类成分含量数据进行相关性分析、主成分分析,建立决策树和逻辑回归模型,根据模型分析黄酮类成分的重要性程度;然后,使用TVAE表格数据生成算法,根据真实数据生成600批虚拟数据,使用自动学习框架AutoGluon,num_bag_folds设为5,分别对64批真实数据和600批虚拟数据进行学习,得到2组共30个模型,依据准确率进行评估。结果:对机器学习模型的分析可知,芒柄花素、毛蕊异黄酮葡萄糖苷和刺芒柄花苷这3种黄酮类成分对于黄芪质量,尤其是来源等级的控制具有重要意义;2组共30个模型预测准确率表明,基于NeuralNet的模型和基于树模型的机器学习算法对于黄酮成分数据表征的黄芪而言分类效果最好;数据增强技术生成的虚拟数据与真实数据在所训练得到的模型准确率趋势方面基本一致。结论:机器学习相关技术在以黄酮为特征的黄芪分类中具有较好的应用价值。 Objective:To establish a three classification model for cultivated,semi-wild,and wild Astragali Radix characterized by flavonoids,and explore and evaluate the application of techniques of automated machine learning and data augmentation in the field of drug analysis.Methods:Firstly,correlation analysis and principal component analysis were conducted on the flavonoid content data of Astragali Radix,and models of decision tree and logistic regression were established to analyze the importance of flavonoid components based on the models.Then,using the AutoGluon framework with 5 as num_bag_folds,2 sets of 30 models respectively through 64 batches of real data and 600 batches of virtual data generated based on real data with the TVAE table data generation algorithm for training were obtained,and these models were evaluated by accuracy.Results:The analysis of machine learning models,indicated that formononetin,campanulin and onospin played the important roles in the quality control of Astragali Radix,especially for the source grade control.The accuracy of model prediction showed that the models based on Neural Net and tree-model always had the best classification effect for Astragali Radix.The virtual data generated by data augmentation technique is basically consistent with the actual data in terms of the accuracy trend of the model training process.Conclusion:Related techniques of machine learning have good application value in the classification of Astragali Radix characterized by flavonoids.
作者 石岩 李宁 魏锋 马双成 SHI Yan;LI Ning;WEI Feng;MA Shuang-cheng(National Institutes for Food and Drug Control,Beijing 102629,China;Beijing Institute for Drug Control,Beijing 102206,China)
出处 《药物分析杂志》 CAS CSCD 北大核心 2024年第5期866-873,共8页 Chinese Journal of Pharmaceutical Analysis
关键词 黄芪 黄酮 毛蕊异黄酮葡萄糖苷 刺芒柄花苷 毛蕊异黄酮 山柰酚 异鼠李素 芒柄花素 机器学习 人工智能 数据增强 Astragali Radix flavonoids campanulin onospin calycosin kaempferol isorhamnetin formonone-tin machine learning artificial intelligence data augmentation
  • 相关文献

参考文献7

二级参考文献99

共引文献107

同被引文献33

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部