自注意力信用评估模型被引量：2

Self-Attention Credit Scoring Model

下载PDF

导出

摘要在信用评估问题中,用户信息中既包含类别数据,也包含数值数据。传统的基于人工智能的信用评估模型通常对类别数据进行one-hot变换后,再与数值数据进行拼接作为判别器的输入。与之不同,借鉴了自然语言处理中的词嵌入技术来提取类别数据的词向量;将输入的词向量集合类比为“句子”,并基于自注意力机制从“句子”中提取出用户特征;最后采用多层感知机来预测用户违约的概率。新模型可以使用反向传播算法实现端到端的训练。在三个不同的数据集上将新模型和六种基准算法进行了比较,结果表明该模型能够比基准算法取得更好的性能。 In the credit scoring problem, the user information contains both category data and numerical data. Traditional artificial intelligence-based credit scoring algorithms usually transform the category data into one-hot vectors and joints them with numerical data, as the input of the discriminator. In contrast, this paper extracts vectors of category data based on the word embedding techniques which are popularly used in the natural language processing problem. After that, the set of the word vectors is analogized to a“sentence”, and the input feature is extracted from the“sentence”based on the self-attention mechanism. Finally, a Multi-Layer Perception(MLP)neural network is used to predict the probability of default. The new model is trained end- to-end by the back propagation method. Experimental results show the proposed new model achieves better performance than six other baselines on three well-known benchmark datasets.

作者刘欣阳曲彦文周琪云 LIU Xinyang;QU Yanwen;ZHOU Qiyun(School of Computer Information and Engineering,Jiangxi Normal University,Nanchang 330022,China)

机构地区江西师范大学计算机信息工程学院

出处《计算机工程与应用》 CSCD 北大核心 2019年第13期36-41,共6页 Computer Engineering and Applications

基金国家自然科学基金(No.61562041,No.61866018)

关键词信用评估自注意力机制词嵌入特征提取深度神经网络 credit scoring self-attention mechanism word embedding feature extraction deep neural network

分类号 TP391.1 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献3

1杨嘉树,梅天灿,仲思东.顾及局部特性的CNN在遥感影像分类的应用[J].计算机工程与应用,2018,54(7):188-195. 被引量：11
2张绮琦,张树群,雷兆宜.基于改进的卷积神经网络的中文情感分类[J].计算机工程与应用,2017,53(22):111-115. 被引量：17
3赵冬梅,刘海峰,张军鹏.基于模糊神经网络的信息安全风险评估模型[J].计算机工程与应用,2009,45(17):116-118. 被引量：11

二级参考文献9

1GB/T20984-2007信息安全技术信息系统的风险评估规范[S].北京:国家标准出版社,2007. 被引量：1
2Rainer R K.Risk analysis for information technology[J].Journal of Management Information Systems, 1991,8( 1 ) : 129-147. 被引量：1
3Tregear J.Risk assessment,Information Security Technical Report[R]. 2001-09. 被引量：1
4Patrick P.Minimization method for training feed forward neural network[J].Neural Network, 1994,7 : 145-163. 被引量：1
5I Hansen L K.Neural network ensembles[J].IEEE Transactions on Pattern Analysis and Machine Intelligence.1990,12(10) :993-1001. 被引量：1
6张青贵.人工神经网络[M].北京:中国水利水电出版社,2004. 被引量：1
7Zhao Dong-mei,Wang Jing-hong,Ma Jian-Feng.Fuzzy risk assessment of network security[C]//Proceedings of the 4th International Conference on Machine Learning and Cybemetics,Dalian,13-16 August 2006: 4400-4405. 被引量：1
8唐慧丰,谭松波,程学旗.基于监督学习的中文情感分类技术比较研究[J].中文信息学报,2007,21(6):88-94. 被引量：136
9孙志军,薛磊,许阳明,王正.深度学习研究综述[J].计算机应用研究,2012,29(8):2806-2810. 被引量：628