摘要
针对节点初始标签散乱及标签传播随机性大的问题,提出一种融合标签预处理与节点影响力的重叠社区发现算法。首先,计算节点影响力,逐步选择影响力值最大的节点作为中心节点;然后,用中心节点的标签对同质的邻居节点进行标签预处理,减少了初始标签数量,降低了后续标签传播的随机性,并初步识别出了重叠节点;其次,通过标签隶属系数识别重叠节点,用节点影响力值选择非重叠节点标签,提高了算法的稳定性和准确性;最后,以最大化自适应函数增量为目标,对内聚度弱的社区进行合并,提高了社区质量。仿真实验结果表明:对于六个真实网络,所提算法在50%的数据集上具有最大的扩展模块度值;而在不同混合度、节点重叠度和节点最大归属社区数的人工基准网络上,该算法在标准化互信息(NMI)指标上都具有最好的性能。综上所述,该算法对各类网络都具有较好的适应性,且具有接近线性的时间复杂度。
Aiming at the problem of scattered initial labels and large randomness of label propagation,an overlapping community detection algorithm fusing label preprocessing and node influence was proposed.Firstly,the influence value of each node was calculated,and the node with the largest influence value was selected as the central node gradually.Secondly,the label of the central node was used to preprocess the labels of the homogeneous neighbor nodes,so as to reduce the number of initial labels as well as the randomness of subsequent label propagation,and preliminarily identify the overlapping nodes.Thirdly,the overlapping nodes were identified by the label belonging coefficient,and the labels of nonoverlapping nodes were selected by the node influence values,improving the stability and accuracy of the proposed algorithm.Finally,in order to maximize the increment of the adaptive function,the communities with weak cohesion were merged together to improve the quality of communities.The simulation experimental results show that the proposed algorithm has the largest extended modularity value on 50%datasets of the six real networks,and has the best performance in Normalized Mutual Information(NMI)index on the artificial benchmark networks with different mixing degrees,overlapping degrees of node and the maximum numbers of communities to which the node belongs.In conclusion,the algorithm has good adaptability to all kinds of networks,and has nearly linear time complexity.
作者
吴清寿
陈荣旺
余文森
刘耿耿
WU Qingshou;CHEN Rongwang;YU Wensen;LIU Genggeng(School of Mathematics and Computer Science,Wuyi University,Wuyishan Fujian 354300,China;Key Laboratory of Cognitive Computing and Intelligent Information Processing of Fujian Education Institutions(Wuyi University),Wuyishan Fujian 354300,China;College of Mathematics and Computer Science,Fuzhou University,Fuzhou Fujian 350116,China)
出处
《计算机应用》
CSCD
北大核心
2020年第12期3578-3585,共8页
journal of Computer Applications
基金
国家自然科学基金资助项目(61877010)
福建省自然科学基金资助项目(2019J01835)。
关键词
重叠社区
中心节点
标签传播
节点影响力
标签隶属系数
overlapping community
central node
label propagation
node influence
label belonging coefficient