摘要
糖尿病是一种可防可控的慢性疾病,会产生很多并发症,对人体危害很大,因此早期诊断糖尿病并干预生活方式对预防糖尿病慢性并发症十分必要。利用健康档案中数据来预测空腹血糖水平,因为空腹血糖水平的高低是早期诊断和干预的一个重要依据,但是健康档案中数据存在维度广、噪声多、强耦合、非线性等特点,为此提出了基于KPCA和LSSVM结合的方法进行建模,并将LSSVM、PCA-LSSVM、KPCA-LSSVM这3种模型进行比较,结果表明KPCA-LSSVM准确性比LSSVM、PCA-LSSVM大幅提高,ROC曲线的积分面积也接近于1,说明KPCA-LSSVM能够运用于空腹血糖的预测,也为医疗数据挖掘提供一种新的参考办法。
Diabetes, a preventable and controllable chronic disease, will cause a lot of complications, which is harmful to people's health. Therefore, the early diagnosis of diabetes and the intervention of lifestyle are very necessary to the prevention of chronic diabetic complications. This paper makes use of the data of electronic health records to predict the level of fasting blood glucose, which is an important basis of early diagnosis and intervention. However, the data of health records have such features as multidimensional, noise interference, strong coupling and nonlinear. Therefore, this paper proposes a method based on the combination of KPCA and LSSVM, and makes a comparison of the three models which are LSSVM, PCA-LSSVM and KPCA-LSSVM. The results show that the accuracy is significantly improved through KPCA-LSSVM, the integral area of ROC curve is also close to 1, which proves the KPCA-LSSVM to be an appropriate method to the prediction of the level of fasting blood glucose. More importantly, it provides a new reference method for medical data mining.
作者
江燕
帅仁俊
张姝
查代奉
JIANG Yan;SHUAI Renjun;ZHANG Shu;ZHA Daifeng(College of Computer Science and Technology,Nanjing Technology University,Nanjing 211816,China;College of Electrical Engineering and Control Science,Nanjing Technology University,Nanjing 211816,China;College of Science,Jiujiang University,Jiujiang,Jiangxi 332000,China)
出处
《计算机工程与应用》
CSCD
北大核心
2018年第13期241-245,共5页
Computer Engineering and Applications
基金
国家自然科学基金(No.61261046)