期刊文献+

基于Rhino的JavaScript动态页面解析研究与实现 被引量:18

Research and Implementation of Interpreting JavaScript Dynamic Web Page Based on Rhino Engine
下载PDF
导出
摘要 面对互联网上占据全国页面总数50%以上的动态页面,当前网络舆情管控工作中的信息采集环节对以动态页面为主要发布形态的互联网媒体无法实现信息获取。鉴于此,文中提出了基于Rhino实现JavaScript动态页面解析的整体方案。实验结果表明该方案充分丰富了互联网舆情管控工作的数据源对象,是实现动态页面内超链接网络地址递归获取和网页主体内容提取行之有效的解决方案。 Dynamlc Web page holds more than 50% of the total Web pages in countywide;however,the information collector of current network public opinion monitoring system can not get the information of Internet medium which uses dynamic Web page as its main content distribution form. Thereby,there is a scheme for interpreting JavaScript dynamic Web page by using Rhino engine presented in this psper. Proved by the experiments, this scheme is an effective one for extracting the hyperlink network addresses and content of dynamic Web page and it has enriched the work data set of network public opinion monitoring.
出处 《计算机技术与发展》 2008年第2期1-4,50,共5页 Computer Technology and Development
基金 国家自然科学基金项目(60502032 60402019) 上海市科委项目(065115020) 教育部新世纪优秀人才支持计划项目(NCET-06-0393)
关键词 脚本解释引擎Rhino JavaScript动态页面 动态页面解析 Rhino script engine JavaSeript dynamic Web page interpret dynamic Web page
  • 相关文献

参考文献7

二级参考文献17

  • 1[1]Eich B. JavaScript C Engine Embedder's Guide[EB/OL]. Http://www.mozilla.org/js/spidermonkey/apidoc/jsguide.html, mozilla.org, march 16, 2000. 被引量:1
  • 2[2]ECMA. ECMA-Script Language Specification Edition 3[EB/OL]. Http://www.mozilla.org/js/language/E262 3.pdf, European Computer manufacturer Association, march 24, 2000. 被引量:1
  • 3[3]Netscape. JavaScript C Engine API Reference[EB/OL]. http://developer.netscape.com/docs/manuals/javascriptapi/index.htm, Netscape Communications Corp., December 17, 1998. 被引量:1
  • 4[4]Netscape. JavaScript 1.5 References[EB/OL]. http://devedge.netscape.com/library/manuals/2000/javascript/1.5/guide/, Netscape Communications Corp., September 28, 2000. 被引量:1
  • 5[10]Fielding R,Gettys J,Mogul J,Frystyk Nielsen H,Masinter L,Leach P,Berners-Lee T.RFC2616,Hypertext Transfer Protocol-HTTP/1.1[S].June 1999. 被引量:1
  • 6[1]Internet System Consortium,Internet Domain Survey[EB/OL],http://www.isc.org/,Jan.2005. 被引量:1
  • 7[3]ISO 8879.Information Processing-Text and Office Systems-Standard Generalized Markup Language (SGML)[S]. 被引量:1
  • 8[6]World Wide Web Consortium,Character entity references in HTML 4[EB/OL],http://www.w3.org/TR/html401/sgml/entities.html,Dec.1999. 被引量:1
  • 9[7]Berners-Lee T,Fielding R,Masinter L.Uniform Resource Identifiers (URI):Generic Syntax[S].August 1998. 被引量:1
  • 10[8]Fielding R.RFC1808.Relative Uniform Resource Locators[S].June 1995. 被引量:1

共引文献40

同被引文献141

引证文献18

二级引证文献83

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部