摘要
聚焦爬虫可以搜集特定领域的信息资源,能够满足人们的个性化需求。编目人员在从事原始编目工作的过程中,如果能够从网络中查找到相应的编目数据作为参考,那么将会大大提高编目效率。因此,将此类编目数据视为一类主题信息资源,用聚焦爬虫进行抓取为编目人员所用就成为一种可能的方案。从聚焦爬虫的内涵和基本构成入手,分析利用聚焦爬虫搜集编目数据的技术,并构建融合聚焦爬虫技术的编目数据搜集模型。
Focused crawler can collect specific areas of information resources, so it can meet people's personalized needs. In the process of original cataloging, if library catalogers can search the appropriate cataloging data for reference on the Internet, it will greatly improve the efficiency of cataloging. Therefore, we can regard these cataloging data as a subject of information resources, and collect these data by using focused crawler for library catalogers to use. Taking the connotation and the basic composition of focused crawler as research starting points, this paper analyzes the technology of collecting cataloged data by using the focused crawler, and constructs the cataloging data collection model based on the technology of focused crawler.
出处
《图书馆学研究》
CSSCI
北大核心
2013年第13期78-80,共3页
Research on Library Science
关键词
聚焦爬虫
编目数据
数据搜集
模型
focused crawler cataloging data data collection model