摘要
为了改善农业领域海量信息的检索效率,采用垂直搜索技术利用混合学习方法的成员搜索引擎选择策略,构建一种元搜索引擎。利用正则表达式的方法,进行农业领域网页特征库的构建。基于农业领域网页特征库,对元搜索引擎初次检索结果集进行筛选排序处理,以此来达到去除非领域相关网页和按照规则重排序的目的,实现查准。利用此特征库对元搜索引擎检索结果进行结果处理操作,最终以统一格式将结果反馈给用户。
To improve search efficiency for vast amount of information in agricultural domain,the vertical search technology was used to build the meta-search engine based on a selection strategy of member search engine using blended learning method.The construction of web page feature base in agriculture by regular expression was for the service of filtering results set that the meta-search engine got initially.It excluded non-domain data,re-ranged the results set and improved the accuracy of the results set.Final results were feedbacked to users.
出处
《江苏农业学报》
CSCD
北大核心
2011年第6期1380-1386,共7页
Jiangsu Journal of Agricultural Sciences
关键词
垂直检索
元搜索引擎
正则表达式
农业领域
网页特征库
vertical search
meta-search engine
regular expression
web page feature base in agriculture