期刊文献+

基于Python的互联网招聘数据采集技术 被引量:2

Data Acquisition Technology of Internet Recruitment
下载PDF
导出
摘要 面对招聘网站发布的海量招聘数据,为了利用技术手段从招聘网站采集招聘数据,本文基于Python语言设计爬虫采集技术并实现了面向猎聘、Boss、拉钩等招聘类网站的数据采集,完成了对全部招聘信息及其详情页面的数据爬取。本文采用Scrapy框架实现对定制网站内容的爬取,并采用图像识别技术解决了爬取过程中遇到的验证码问题,最终成功获取50000余条数据。 Facing the massive recruitment data published by recruitment websites,in order to collect recruitment data from recruitment websites by technical means,this paper designs crawler collection technology based on Python language,and realizes data collection for recruitment websites such as Liepin,boss and hook,and crawls all recruitment information and its detailed pages.In this paper,Scrapy framework is used to crawl the content of customized website,and image recognition technology is used to solve the verification code problem encountered in crawling process,and finally more than 50000 pieces of data are successfully obtained.
作者 孙暖 曹小平 刘军 Sun Nuan;Cao Xiaoping;Liu Jun(Chongqing Creation Vocational College,Chongqing 402160,China)
出处 《信息与电脑》 2020年第18期161-163,共3页 Information & Computer
基金 重庆市高等教育教学改革研究项目(项目编号:202182)。
关键词 PYTHON 数据采集 爬虫 Scrapy Python data collection spider Scrapy
  • 相关文献

参考文献8

二级参考文献34

  • 1周立柱,林玲.聚焦爬虫技术研究综述[J].计算机应用,2005,25(9):1965-1969. 被引量:153
  • 2TUMASJAN A, SPRENGER T O, SANDNER P G, et al. Predicting elections with Twitter: what 140 characters reveal about political sentiment[C] // Proceedings of the Fourth International AAAI Conference on Weblogs and Social Media. Madison: AAAI Press, 2010, 10: 178-185. 被引量:1
  • 3WELCH M J, SCHONFELD U, HE D, et al. Topical semantics of twitter links[C] // Proceedings of the Fourth ACM International Conference on Web Search and Data Mining. New York: ACM Press, 2011: 327-336. 被引量:1
  • 4CARLISLE J E, PATTON R C. Is social media changing how we understand political engagement? An analysis of Facebook and the 2008 presidential election[J]. Political Research Quarterly, 2013, 66(4): 883-895. 被引量:1
  • 5CUNLIFFE D, MORRIS D, PRYS C. Young bilinguals' language behaviour in social networking sites: the use of welsh on Facebook[J]. Journal of Computer-Mediated Communication, 2013, 18(3): 339-361. 被引量:1
  • 6STRAFLING N, KRAMER N C. Learning together on Facebook et al. The influence of social aspects and personality on the usage of social media for study related exchange [J]. Gruppendynamik und Organisationsberatung, 2013, 44(4): 409-428. 被引量:1
  • 7DUAN J Y, DHOLAKIA N. The reshaping of Chinese consumer values in the social media era: exploring the impact of Weibo [J]. Journal of Macromarketing, 2013, 33(4): 402-403. 被引量:1
  • 8HUANG R, SUN X. Weibo network, information diffusion and implications for collective action in China [J]. Information Communication and Society, 2014, 17(1): 86-104. 被引量:1
  • 9MAZO J. Blocked on Weibo: what gets suppressed on China's version of Twitter (and why) [J]. Survival, 2013, 55(6): 191-192. 被引量:1
  • 10POELL T, de KLOET J, ZENG G, et al. Will the real Weibo please stand up? Chinese online contention and actor-network theory [J]. Chinese Journal of Communication, 2014,7(1): 1-18. 被引量:1

共引文献129

同被引文献10

引证文献2

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部