期刊文献+

电商异构网络中基于多层信息融合的用户社区划分算法 被引量:1

User Community Partition Based on Multi-layer Information Fusion in E-commerce Heterogeneous Network
原文传递
导出
摘要 【目的】当前用户社区划分算法大多因缺乏对电商网络异构性的考量,导致社区划分准确度不高。为此,本文提出一种电商异构网络中基于多层信息融合的用户社区划分算法。【方法】根据不同关系类型对电商异构网络进行分层处理,构造基于不同关系类型的用户节点嵌入;通过表征融合将不同层的用户嵌入合并,获得电商异构网络中的用户融合嵌入表征;使用目标函数优化用户节点的相关参数;最后,通过改进的K-means算法形成用户聚类,得到合理的用户社区划分结果。【结果】本文所提算法与基于DeepWalk、Node2Vec、GCN等主流用户社区划分算法中的次优算法相比,在NMI和Sim@5指标上分别提升6.4%和1.7%,在有效表征用户节点及精确划分用户社区方面都有良好的表现。【局限】未考虑电商异构网络中所包含的时间信息,同时忽略了网络中噪声点所产生的影响。【结论】本文算法切实有效,在电商领域有助于提升好友预测、群组推荐等核心应用的性能。 [Objective] This paper proposes a new algorithm based on multi-layer information fusion in an e-commerce heterogeneous network, aiming to improve the accuracy of user community division. [Methods] First,we conducted hierarchical processing of the e-commerce heterogeneous networks and constructed user node embeddings based on different relationship types. Then, we merged users of different layers and obtained their embedding characterization in e-commerce heterogeneous networks. Third, we used the objective function to optimize the relevant parameters of the user nodes. Finally, we clustered these users with an improved K-means algorithm, and created the reasonable community division. [Results] The NMI and Sim@5 indicators of the proposed algorithm were 6.4% and 1.7% higher than the existing algorithms based on DeepWalk, Node2Vec, and GCN. The model effectively characterized user nodes and accurately divided their communities. [Limitations] We did not examine the time information and noise points from the heterogeneous network. [Conclusions] The proposed algorithm could improve the performance of friend prediction, group recommendation and other applications.
作者 冯勇 徐文韬 王嵘冰 徐红艳 张永刚 Feng Yong;Xu Wentao;Wang Rongbing;Xu Hongyan;Zhang Yonggang(College of Information,Liaoning University,Shenyang 110036,China;Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education,Jilin University,Changchun 130012,China)
出处 《数据分析与知识发现》 CSSCI CSCD 北大核心 2022年第5期89-98,共10页 Data Analysis and Knowledge Discovery
基金 吉林大学教育部符号计算与知识工程重点实验室资助项目(项目编号:93K172018K01) 辽宁省教育厅科学研究基金面上项目(项目编号:LJKZ0085)的研究成果之一。
关键词 异构网络 电子商务 表征学习 社区划分 Heterogeneous Network E-commerce Representation Learning Community Division
  • 相关文献

参考文献8

二级参考文献42

  • 1CALINSKI R,HARABASZ J.A dendrite method for cluster analysis[J].Communications in Statistics,1974,3(1):1 -27. 被引量:1
  • 2DAVIES D L,BOULDIN D W.A cluster separation measure[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,1979,1(2):224-227. 被引量:1
  • 3DUDOIT S,FRIDLYAND J.A prediction-based resampling method for estimating the number of clusters in a dataset[J].Genome Biology,2002,3(7):1-21. 被引量:1
  • 4DIMITRIADOU E,DOLNICAR S,WEINGESSEL A.An examination of indexes for determining the number of cluster in binary data sets[J].Psychometrika,2002,67(1):137-160. 被引量:1
  • 5KAPP A V,TIBSHIRANI R.Are clusters found in one dataset present in another dataset?[J].Biostatistics,2007,8(1):9-31. 被引量:1
  • 6ROUSSEEUW P J.Silhouettes:a graphical aid to the interpretation and validation of cluster analysis[J].Journal of Computational and Applied Mathematics,1987,20(1):53 -65. 被引量:1
  • 7DEMB(E)L(E) D,KASTNER P.Fuzzy C-means method for clustering microarray data[J].Bioinformatics,2003,19(8):973-980. 被引量:1
  • 8孙吉贵,刘杰,赵连宇.聚类算法研究[J].软件学报,2008(1):48-61. 被引量:1070
  • 9付剑锋,刘宗田,刘炜,周文.基于层叠条件随机场的事件因果关系抽取[J].模式识别与人工智能,2011,24(4):567-573. 被引量:20
  • 10武志昊,林友芳,Steve Gregory,万怀宇,田盛丰.Balanced Multi-Label Propagation for Overlapping Community Detection in Social Networks[J].Journal of Computer Science & Technology,2012,27(3):468-479. 被引量:41

共引文献281

同被引文献9

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部