摘要
随着互联网金融的迅速发展,配资类网站给人们的财产安全造成的威胁日趋严重.而传统的恶意网站识别技术只适用于部分特征显著的网站识别,导致对配资网站的识别效果不佳.本文从多个维度选取特征,将识别特征归纳为域名特征、搜索引擎收录特征、标签特征、图片特征和文本特征等五大类,较好地体现了配资网站与其他类别网站的本质不同,并结合深度神经网络,建立配资网站识别模型.为验证该模型的有效性,论文设计了深度神经网络模型与决策树算法、支持向量机算法、K-邻近算法的对比实验.从实验中发现,基于深度神经网络的配资网站识别模型提高了配资网站的识别准确率,模型准确率达到95.9%,精确率达到98.7%,各类评估指标效果均优于传统的机器学习算法.实验结果表明,该方法能有效地识别配资网站.
With the rapid development of Internet Finance,the existence of financing websites has become a much more serious problem for personal property safety.However,the traditional website recognition technology is only applicable to the website identification with some remarkable features,resulting in low efficiency of financing websites detection.This paper selects features from multiple dimensions and summarizes detection features into five categories:domain name features,search engines index features,tag features,image features,textual features,which greatly reflect the essential difference between the financing websites and other types of websites.Then a recognition model with deep neural network is proposed.In order to verify the validity of the model,a comparison experiment of our model with decision tree algorithm,support vector machine algorithm and K-Nearest Neighbor algorithm is designed.The experiments demonstrate that the accuracy and precision of the accuracy and precision of the proposed model is 95.9%,98.7%respectively,and all kinds of evaluation indicators are better than the traditional machine learning algorithm.The results show that the proposed method can effectively detect the financing websites.
作者
何颖
杨频
王丛双
汤娟
HE Ying;YANG Pin;WANG Cong-Shuang;TANG Juan(School of Cybersecurity,Sichuan University,Chengdu 610207,China)
出处
《四川大学学报(自然科学版)》
CAS
CSCD
北大核心
2021年第3期91-97,共7页
Journal of Sichuan University(Natural Science Edition)
基金
四川省科技计划项目(2020YFG0076)。
关键词
配资网站
网站识别
深度神经网络
特征工程
Financing website
Website identification
Deep neural network
Feature engineering