期刊文献+

一种基于超链和锚文本分析的主题发现算法 被引量:1

An Algorithm of Topic Search Based on Hyperlink and Anchor
下载PDF
导出
摘要 针对HITS算法中容易出现"主题漂移"的问题,提出了一种新的主题发现算法.首先从对权威度贡献大小的角度出发,将链接分为两类,首尾节点均在根集中的链接为第一类,赋予较高的权重,同时通过分析链接锚文本来调整另一类链接的权重.通过这种方法,有效地改善了主题提取的质量. According to the problem that the Topic Draft is easy to occur in the HITS algorithm, this paper presents a new topic search algorithm. First, based on contribution for authority score, all hyperlinks were divided into two groups. One group including the links whose first node and last node belong to Root Set was endowed with higher weight. And through analyzing the hyperlink anchor, the weight of another group was adjusted. An example demonstrating its capabilities was provided.
出处 《微电子学与计算机》 CSCD 北大核心 2009年第6期125-128,共4页 Microelectronics & Computer
关键词 HITS算法 主题漂移现象 链接锚文本 HITS topic draft hyperlink anchor
  • 相关文献

参考文献6

  • 1Kleinbergk J. Authoritative sources in a hyperlinked environment[J ]. Journal of the ACM, 1999, 46 (5) : 604 - 632. 被引量:1
  • 2Soumen Chakrabarti, Byron Dom, David Gibson, et al. Automatic resource compilation by analyzing hyperlink structure and associated text[C]//Proc. 7th International WWW Conference. USA: University of California, 1998. 被引量:1
  • 3Chris H Q Ding, Hortgyuan Zha, Xiaofeng He, et al. Link analysis: hub and authorities on the world wide web LBNL Tech Report 47847[R]. 2001. 被引量:1
  • 4Allan Borodin, Gareth O Roberts, Jeffrey S Rosenthal, et al. Finding authorities and hubs from link Structures on the World Wide WEB[C]//Proc. 10th International WWW Conference. Hongkong, 2001. 被引量:1
  • 5赵仲孟,何世丽,袁薇,沈钧毅.主题搜索引擎中专业网页索引集构造算法的研究[J].微电子学与计算机,2005,22(1):6-9. 被引量:3
  • 6Golub G, Van Loan C. Matrix computations[M]. 3rd ed. Baltimore: Johns Hopkins: 1996. 被引量:1

二级参考文献6

  • 1余辰.[D].中国科学院研究生院,May,2002:15—23. 被引量:1
  • 2S Lawrence, C L Giles. Accessibility and Distribution of information on the Web [J]. Nature,400: 107-109.July 1999. 被引量:1
  • 3Venkat N Gudivada, Vijay V Raghavan, William I Grosky,Rajesh Kasanagottu. Information Retrieval on the World Wide Web[J], IEEE Intemet Computing, 1997, 1(5): 58-68. 被引量:1
  • 4Kleinberg J M. Authoritative sources in hyperlinked environment[J]. Journal of ACM, Sept, 1999, 46(5): 604-632. 被引量:1
  • 5S Chakrabarti, M van den Ber,gB Dom. Focused Crawling:A new Approach to Topic-Specific Web Resource Discovery[J]. Computer Networks, 1999,31:1623-1640. 被引量:1
  • 6宋聚平,王永成,尹中航,滕伟.面向主题的网页搜索系统[J].上海交通大学学报,2003,37(3):401-403. 被引量:12

共引文献2

同被引文献10

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部