摘要
针对垂直搜索引擎研究领域的关键技术问题,提出了一个结合本体筛选和文本挖掘的垂直搜索引擎构建思想。首先探讨了作为研究基础的本体和文本挖掘技术,讨论了两者的作用;之后阐述了垂直搜索引擎构建的关键技术,包括基于本体筛选的智能搜索器、结合文本挖掘的网页信息分析及抽取、索引器及查询处理器的构造;最后,对提出的思想进行了实现验证,构造一个面向高校毕业生招聘的垂直搜索引擎原型。
This paper presents a construction method for vertical search engine utilizing ontology filtering and text mining towards existing problems in the domain. Firsdy, it discusses ontology and text mining as well as their appllcations. Then, we provide a set of key techniques for the construction of vertical search engine which include ontology-based Web crawling, Web page analyzing combined with text mining, indexer and searcher constructing. Finally, an evaluation of our proposed ideas is presented by implementing a prototype of job hunting search engine towards college students.
出处
《计算机科学》
CSCD
北大核心
2008年第2期188-190,共3页
Computer Science
基金
国家自然科学基金资助项目(编号60573084)
武器装备预研基金(9140A15050106HK0114)
关键词
垂直搜索
本体
本体筛选
文本挖掘
Vertical search, Ontology, Ontology filtering, Text mining