摘要
web长期保存既存在管理方面的困扰,也存在技术方面的困扰。管理困扰包括保存的合法性、保存内容的选择、恶意软件的去留、网页的去重,技术困扰包括网页收割工具的局限性、web保存的真实性、时间一致性、保存格式的有效性。另外,集体贡献型网站的保存还存在一些特殊的困扰,包括网站抓取的困扰、产权许可的困难、保存动机的缺失等。
The long-term preservation of web has some confusions in both management and technology. The confusions in management include legality of preservation, selection of web sites in preservation, removing or keeping off viruses and malware, and web page de-duplication. The confusions in technology include limitation in web harvesting tools, authenticity of web preservation, temporal coherence, and validity of preservation format. In addition, the preservation for web sites in collective contribution has some special confusions, including site scraping, difficulty in property right permission, and deficiency of preservation motivation. 11 refs.
出处
《国家图书馆学刊》
CSSCI
北大核心
2016年第1期99-105,共7页
Journal of The National Library of China
关键词
web保存
数字保存
数字保存质量
Web Preservation
Digital Preservation
Quality of Digital Preservation