面向校园网的IP地址逐步优化层次聚类算法被引量：3

Campus-oriented stepwise-optimal hierarchical clustering algorithm of IP address

下载PDF

导出

摘要对校园网主干数据流中IP地址进行聚类,可以得到网络用户访问地址的分布概况从而了解用户行为特征。已有聚类算法大都将IP地址作为普通数字考虑,忽略了其特征属性以致聚类结果不合理。为此提出一种改进算法:首先基于最长前缀匹配和改进的最近邻规则算法得到初始聚类,然后运用逐步优化层次聚类的思想进一步聚合最靠近子类,最终得到基于IP地址特征属性的聚类。实验结果表明该算法与以往算法相比,提高了聚类效果,具有较好的准确性和可行性。 The cluster analysis of IP addresses can reveal useful knowledge for profiling of traffic flows and user behavior. However, the popular clustering algorithms were not applicable directly to IP addresses of the campus network traffic flows. The clusters which were generated by generic algorithms were inconsistent with the IP addresses partition and difficult to interpret. To overcome the shortcoming of the current algorithms which neglect the characteristics of IP addresses, a new algorithm which could effectively improve IP addresses clustering was proposed. Firstly, the initial clusters were got by adopting the longest prefix algorithm and the nearest neighbor clustering algorithm. Then the thought of stepwise-optimal hierarchical clustering was applied to merge the nearest groups of initial clusters. The similarity between initial clusters was determined by the longest prefix of IP addresses contained in these clusters. Finally, the algorithm automatically and meaningfully yielded clusters which were in accord with the characteristics of IP addresses on traffic flows. The results show that the proposed algorithm is accurate and effective in clustering IP addresses and robust to the input sequence of data.

作者楼若岩许晓东朱士瑞

机构地区江苏大学计算机科学与通信工程学院

出处《计算机应用》 CSCD 北大核心 2007年第8期1862-1864,1867,共4页 journal of Computer Applications

基金江苏省教育厅高校科学研究基金资助项目(03KJD520073)

关键词 IP地址聚类最近邻规则最长前缀匹配逐步优化的层次聚类 IP address clustering nearest neighbor nile Longest Prefix Match （LPM） stepwise-optimal hierarchical clustering

分类号 TP312 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献9

1DUNHAM M H.Data mining:introductory and advanced topics[M].Beijing:Tsinghua University Press,2003. 被引量：1
2FREITAS A A.Data mining and knowledge discovery with evolutionary algorithms[M].Berlin:Springer-Verlag,2002. 被引量：1
3KRISHMA K,MURTY M N.Genetic k means algorithm[J].IEEE Transactions on Systems,Man and Cybernetics,Part B,1999,29(3):433-439. 被引量：1
4ESTAN C,SAVAGE S,VARGHESE G.Automatically inferring patterns of resource consumption in network traffic[C]// Proceedings of the ACM SIGCOMM 2003.[S.l.]:ACM Press,2003:137-148. 被引量：1
5VAARANDI R.A data clustering algorithm for mining patterns from event logs[C]// Proceedings of the 2003 IEEE Workshop on IP Operations and Management.[S.l.]:IEEE Press,2003:119-126. 被引量：1
6WALDVOG M.Fast longest prefix matching:algorithms analysis and applications[D].Zurich:ETH,Department of Electrical Engineering,2002. 被引量：1
7DUDA R O,HART P E,STORK D G.Pattern classification[M].2nd ed.Beijing:China Machine Press,2004. 被引量：1
8KARIM A,AHMAD I,JAMI S I.Cluster analysis of traffic flows on a campus network[C]// Proceedings of the 24th IASTED International Multi-Conference,Artificial Intelligence and Applications.Austria:ACTA Press,2006:416-421. 被引量：1
9KRISHNAMURTHY B,WANG J.On network aware clustering of web clients[C]// Proceedings of the ACM SIGCOMM 2000.[S.l.]:ACM Press,2000. 被引量：1

同被引文献18

1赵金生.P2P业务对我国互联网业务的发展影响[J].电信网技术,2006(2):71-73. 被引量：4
2潘莹,梁京章,黎慧娟.基于K-means算法的校园网用户行为聚类分析[J].计算技术与自动化,2007,26(1):66-69. 被引量：10
3SHULZE H, MOCHALSKI K. Internet study 2008/2009[ EB/OL]. [2010 - 06 - 25]. http://www. ipoque, com/study/ipoque-Internet-Study438-09. pdf. 被引量：1
4PeerApp white paper: Comparing P2P solutions[EB/OL]. [2010 - 06 - 25]. http://www. peerapp. com/. 被引量：1
5SALEH O, HEFEEDA M. Modeling and caching of peer-to-peer traffic[ C]// Proceedings of the 2006 14th IEEE International Conference on Network Protocols. Washington, DC: IEEE Computer Society, 2006:249 -258. 被引量：1
6YE MINJIANG, WU JIANPING, XU KE. Cashing the P2P traffic in ISP network[ C]// IEEE International Congerence on Communications. Washington, DC: IEEE Computer Society, 2008:5876 -5880. 被引量：1
7XIE H Y, RICHARD Y, KRISHNAMURTHY A, et al. P4P: Provider portal for applications [ C]// Proceedings of the ACM SIG- COMM 2008 Conference on Data Communication. New York: ACM, 2008:351-362. 被引量：1
8CHOFFNES D R, BUSTAMANTE F E. Taming the torrent: A practical approach to reducing cross-ISP traffic in peer-to-peer systems [J]. ACM SIGCOMM Computer Communication Review, 2008, 38 (4) : 363 -374. 被引量：1
9BINDAL R, CAO P, CHAN W, et al. Improving traffic locality in BitTorrent via biased neighbor selection[ C]// IEEE International Conference on Distributed Computing Systems. Washington, DC: IEEE Computer Society, 2006:1063 -6927. 被引量：1
10Sniffer Portable [ EB/OL]. [ 2010 - 06 - 25]. http://www, sniffer, net. cn/product/product_line/sniffer - portable - professional/. 被引量：1

引证文献3

1唐红,张云龙.BitTorrent流量控制方案[J].计算机应用,2011,31(2):304-307. 被引量：2
2李常先.大学校园用户网络行为分析系统研究[J].统计与管理,2013(4):144-145. 被引量：3
3郭玉彬,吴宇航,薄傲峰,郑淑敏,张晓鹏.基于认证数据的学生上网时间特征分析[J].计算机应用与软件,2019,36(11):101-106. 被引量：4

二级引证文献9

1赵培勇.大数据时代数学教学在农机信息化技术中的应用[J].农机化研究,2020,42(9):233-237. 被引量：1
2王城,陈兴蜀,杨邓奇,刘莉伟.P2P网络中自适应节点选择策略[J].计算机工程与设计,2012,33(6):2107-2111. 被引量：1
3肖承伟,王珂,范红.优化EPON对本地P2P业务承载能力的研究[J].南京邮电大学学报（自然科学版）,2013,33(5):39-44.
4郑羽.基于应用识别技术的网络行为分析系统UAAE[J].安徽电子信息职业技术学院学报,2017,16(1):17-20.
5位晓晓,李常先,徐德光.无线网络环境下大学生网络行为模型构建及防护对策研究[J].微型电脑应用,2019,35(5):18-21. 被引量：2
6蒋德文.初中信息技术考试管理系统设计与实现[J].信息与电脑,2021,33(12):82-84.
7郭绍永,白东玲.平安校园下高校上网认证系统优化设计与实现[J].电脑编程技巧与维护,2022(5):57-60.
8唐鹭,李智,蒋方茂.基于K-means算法的高校学生上网行为研究分析[J].现代信息科技,2022,6(6):38-40.
9叶倩,高明,田亮亮,韦雨萌,刘翼.基于时间戳间距的用户在线时长聚类方法[J].现代电子技术,2024,47(16):47-50.

1傅晓阳,郭晨.改进型遗传神经网络在模式分类中的应用[J].大连海事大学学报,2009,35(1):85-88. 被引量：1
2郝红卫,蒋蓉蓉.基于最近邻规则的神经网络训练样本选择方法[J].自动化学报,2007,33(12):1247-1251. 被引量：37
3胡航博,翟丹润.基于群组的个性化搜索算法研究[J].河南理工大学学报（自然科学版）,2010,29(1):131-134.
4梁喜涛,顾磊.基于最近邻的主动学习分词方法[J].计算机科学,2015,42(6):228-232. 被引量：1
5张洪,段海新,吴建平.基于IP地址聚类的反垃圾邮件信誉系统[J].清华大学学报（自然科学版）,2010,50(10):1723-1727. 被引量：4
6胡杭琴,赵霁.一种MIS对象和数据访问控制的解决方案[J].工业控制计算机,2006,19(4):56-58. 被引量：3
7马志强,季振洲,胡铭曾.一种基于最远块对的低静态功耗指令Cache方案[J].高技术通讯,2007,17(8):771-777. 被引量：1
8沈亢伟.一种基于指纹线方向的指纹图像增强算法[J].杭州电子科技大学学报（自然科学版）,2005,25(5):72-75.
9潘章明.半监督的自动聚类[J].计算机应用,2010,30(10):2614-2617. 被引量：2
10赵理,王磊,徐庆征.人工内分泌机制在最近邻规则约减中的应用[J].应用科学学报,2012,30(4):397-407. 被引量：2

计算机应用

2007年第8期

浏览历史

内容加载中请稍等...

面向校园网的IP地址逐步优化层次聚类算法被引量：3

参考文献9

同被引文献18

引证文献3

二级引证文献9

相关作者

相关机构

相关主题

浏览历史

面向校园网的IP地址逐步优化层次聚类算法 被引量：3

参考文献9

同被引文献18

引证文献3

二级引证文献9

相关作者

相关机构

相关主题

浏览历史

面向校园网的IP地址逐步优化层次聚类算法被引量：3