期刊文献+

基于本体的主题爬虫的设计与实现 被引量:2

The Design and Implementation of Ontology Based Theme Crawler
下载PDF
导出
摘要 分析了最好优先搜索策略中遇到的隧道问题,设计并实现了一种基于本体的主题爬虫系统。 This paper analyzes the tunnel problem during the best-first search strategy, designs and realizes an ontology based theme crawler system.
作者 杨贞 杜习英
出处 《科技情报开发与经济》 2008年第2期73-75,共3页 Sci-Tech Information Development & Economy
关键词 主题爬虫 本体 最好优先算法 theme crawler ontology best-first search algorithm
  • 相关文献

参考文献11

  • 1Donna Bergmark,Carl Lagoze;Alex Sbityakov.Focused crawls, tunneling and digital libraries [C]//Researeh and Advances Technology for Digital Technology : 6th European Conference.Rome: EC DL, 2002: 91-106. 被引量:1
  • 2Hsu C W,Lin C J.A Comparison on Methods for Multi-class Support Vector Machines [ J ].IEEE Transactions on Neural Networks, 2002,13 (2) : 415-425. 被引量:1
  • 3Menczer F,Pant G,Srinivasan P.Topical web. crawlers: Evaluating adaptive algorithms[ J ].ACM Trans,2004(4):378-419. 被引量:1
  • 4GAUTAM PANT,PADMINI SRINIVASAN.Learning to Crawl: Comparing Classification Schemes [J].ACM Transactions on Information Systems, 2005,23 ( 4 ) : 430-462. 被引量:1
  • 5Thanh Tin Tang,David Hawking,Nick Craswell.Focused crawling for both topical relevance and quality of medical information [ C ]//Proceedings of the 14th ACM international conference on Information and knowledge management.Bremen : [ s.n. ], 2005. 被引量:1
  • 6SRINIVASAN P,MENCZER F,PANT G.A general evaluation framework for topical crawlers [ J ].Information Retrieval, 2005 ( 3 ) : 417-447. 被引量:1
  • 7周立柱,林玲.聚焦爬虫技术研究综述[J].计算机应用,2005,25(9):1965-1969. 被引量:153
  • 8燕辉,叶震,董泽浩,高柯俊.报文摘要算法MD5分析[J].合肥工业大学学报(自然科学版),2002,25(1):150-155. 被引量:11
  • 9李晓明,凤旺森.两种对URL的散列效果很好的函数[J].软件学报,2004,15(2):179-184. 被引量:45
  • 10RENNIE J, MCCALLUM A.Using reinforcement learning to spider the Web efficiently [C]//Proc of the International Conference on Machine Learning(ICML 99 ).[ s.l. ] : [ s.n. ], 1999. 被引量:1

二级参考文献39

  • 1Arto S 丁存生(译).公钥密码学[M].北京:国防工业出版社,1998.7-8. 被引量:1
  • 2Schheier B 吴世忠(译).应用密码学[M].北京:机械工业出版社,2000.311-315. 被引量:1
  • 3Cormen TH,Leiserson CE.Introduction to Algorithms.2nd ed.,Cambridge:MIT Press,2001.221-252. 被引量:1
  • 4Knuth DE.Sorting and Searching,Volume 3 of the Art of Computer Programming.New York:Addison-Wesley,1973.506-549. 被引量:1
  • 5McKenzie BJ,Harries R,Bell T.Selecting a hashing algorithm.Software Practice and Experience,1990,20(2):208-210. 被引量:1
  • 6Tong MCF.General hashing [Ph.D.Thesis].Computer Science Department,University of Auckland,1996. 被引量:1
  • 7Peter K.Pearson,fast hashing of variable length text strings.Communications of the ACM,1990,33(6):676-678. 被引量:1
  • 8Berners-Lee T.Universal resource locator.2003.http://www.w3.org/Addressing/URL/Overview.html 被引量:1
  • 9Yan HF,Wang JY,Li XM,Guo L.Architectural design and evaluation of an efficient Web-crawling system.Journal of System and Software,2002,60(3):185-193. 被引量:1
  • 10Shaffer CA.Zhang M,Liu XD,Trans.Data Structure and Algorithm Analysis.Beijing:Publishing House of Electronics Industry,1998.211-213(in Chinese). 被引量:1

共引文献206

同被引文献18

  • 1周立柱,林玲.聚焦爬虫技术研究综述[J].计算机应用,2005,25(9):1965-1969. 被引量:153
  • 2姚树宇,赵少东.一种使用分布式技术的搜索引擎[J].计算机应用与软件,2005,22(10):127-129. 被引量:7
  • 3QIN JL, ZHOU YL, CHAU M. Building domain specific web collec- tions for scientific digital libraries: a meta search enhanced focused crawling method [A]. Proceedings of the 4th ACM/IEEE - CS joint conference on Digital libraries [C]. 2004. 6. 被引量:1
  • 4KRISHNA B, GEORGEAM. When experts agree: using nenaffiliated experts to rank popular topics: prec. of the 10th International World WideWeb Conference [C]. [s.l.]: [s.n.], 2001. 被引量:1
  • 5HAVELIWALA T H. Topic---sensitive PageRank: proc. of the 11th International World Wide Web Conference [ C ]. [ s.l. ]: [ s. n. ], 2002. 被引量:1
  • 6Kleinberg J. Authoritative sources in a hyperlinked environment [J]. Journal of the ACM, 1999, 46 (5) : 604 - 632. 被引量:1
  • 7M. R. Henzinge. Hyperlink analysis for the Web [ J]. IEEE. lntemet Computing, Jan/Feb, 2001, 5 (1): 45- 50. 被引量:1
  • 8Yang Shengyuan. A focused crawler with ontology- supported website models for information agents [ C ]. Advances in Grid and Pervasive Computing, 2010: 522-532. 被引量:1
  • 9UDDIN M Z, LEE J J, KIM T S, Independent shape component- based human activity recognition via Hidden Markov Model [J]. Ap- plied Intelligence, 2010, 33 (2) : 193 - 206. 被引量:1
  • 10刘金红,陆余良.主题网络爬虫研究综述[J].计算机应用研究,2007,24(10):26-29. 被引量:131

引证文献2

二级引证文献18

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部