期刊文献+

基于聚类的Web链接抽取 被引量:1

原文传递
导出
摘要 互联网是通过超链连接起来的网页,为人们的日常和商务用途提供了非常丰富的信息资源。链接结构分析在万维网的很多研究领域起着越来越重要的作用。然而链接中存在着许多与主题无关的链接,造成了主题漂移。本文分析了链接本身的特点,介绍了一种基于聚类的与网站模版无关的自动WEB链接抽取方法。试验结果表明该算法具有实用的价值。
出处 《网络安全技术与应用》 2009年第3期75-77,共3页 Network Security Technology & Application
  • 相关文献

参考文献3

二级参考文献57

  • 1王琦,唐世渭,杨冬青,王腾蛟.基于DOM的网页主题信息自动提取[J].计算机研究与发展,2004,41(10):1786-1792. 被引量:81
  • 2朱红灿,孟志青.一种基于SOM和层次凝聚的中文文本聚类方法[J].湘潭大学自然科学学报,2005,27(3):36-40. 被引量:8
  • 3Ding J, Gravano L, Shivakumar N. Computing geographical scopes of Web resources. In: Amr A, et al., eds. Proceedings of the 26th International Conference on Very Large Data Bases. Cairo: Morgan Kaufmann Publishers, 2000. 545-556. 被引量:1
  • 4Bar-Yossef Z. Approximating aggregate queries about Web pages via random walks. In: Amr A, et al., eds. Proceedings of the 26th International Conference on Very Large Data Bases. Cairo: Morgan Kanfmann Publishers, 2000. 535-544. 被引量:1
  • 5Larson R. Bibliometrics of the World Wide Web: An exploratory analysis of the intellectual stTucture of cyberspace. In: Hans-Peter F, et al., eds. Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Zurich: ACM Press, 1996. 85-92. 被引量:1
  • 6Botafago A. Cluster analysis for hypertext systems. In: Robert K, et al., eds. Proceedings of the 16th Annual ACM SIGIR Conference on Research and Development in Information Retrieval. Pittsburgh: ACM Press, 1993. 116-125. 被引量:1
  • 7Mukherjea S. WTMS: A system for collecting and analyzing topic-specified web information. In: Albert V, et al., eds. Proceedings of the 9th ACM-WWW International Conference. Amsterdam: ACM Press, 2000. 457--471. 被引量:1
  • 8Kumar R, Raghavan P, Rajagopalan S, Sivakumar D, Tomkins A, Upfal E. The Web as a graph. In: Serge A, ed. Proceedings of the 18th ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems. Pennsylvania: ACM Press, 1999.109-118. 被引量:1
  • 9Carriere J, Kazman R. WebQuery: Searching and visualizing the Web through connectivity. Computer Networks and ISDN Systems, 1997,29(8-13): 1257-1267. 被引量:1
  • 10Chakrabarti S, Dora B, Indyk P. Enhanced hypertext classification using hyperlinks. In: Laura H, ed. Proceedings of the ACM SIGMOD International Conference on Management of Data. Washington: ACM Press, 1998. 307-318. 被引量:1

共引文献66

同被引文献6

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部