摘要
高质量的生物多样性数据能够为生物多样性的研究与保护提供数据支撑。目前研究人员开发了大量的生物多样性数据处理软件或工具,包括工作流系统、R语言包、Python语言包和Excel工具等,但是使用这些软件或工具需要用户安装相应的软件客户端,并掌握一定的编程语言、软件开发和复杂的Excel公式等知识和技能。为降低用户的学习成本和使用门槛,本文采用了Browser/Server模式设计技术、Web技术、可视化技术、响应式开发技术、网络爬虫技术、数据处理技术和Solr智能检索技术等,针对不同维度的生物多样性数据设计和开发了相应的数据处理模块,构建了中国生物多样性在线数据处理平台(http://dp.iflora.cn/)。该平台能够有效地帮助科研人员对物种名称、地理位置、时间日期和经纬度等数据进行处理,并提供数据格式转换、数据质量评测和资源统计分析等辅助功能,帮助科研人员实现零代码和低门槛地处理生物多样性数据,提供便捷、高效和简单的数据清洗、校正、转换和整合等数据处理渠道,为生物多样性研究和保护提供信息化技术支持与服务。
Aims:Biodiversity contributes to the most basic living environment and material conditions for human beings,and it serves as the basis for human survival and social development.But natural environmental change and over-interference of human behavior have caused a gradual loss of biodiversity.High-quality biodiversity data can facilitate biodiversity research and conservation in order to mitigate these losses.Currently,researchers have developed many biodiversity data processing tools,including workflow systems,R language packages,Python language packages,and Excel tools.However,using these software or tools require users to install the corresponding software clients and acquire certain knowledge and skills in utilizing programming languages,software development and complex Excel formulas.This all requires a high learning cost and usage threshold,rendering these tools inaccessible for some user.For this reason,this paper aims to describe a Chinese biodiversity online data processing platform(CBODPP)to aid researchers in achieving a zero code and a low usage threshold for biodiversity data processing work.Method:The CBODPP is designed in Browser/Server mode and implemented using a web-based client.The platform pages are developed based on reactive development technology,which is compatible with both computer and mobile browsers.The platform realizes service functions such as scientific name correction,geographic location analysis and inverse geocoding based on web crawler technology,data processing technology and Solr intelligent search technology.In addition,the platform has also developed corresponding data processing modules for biodiversity data of different dimensions.Users can process data in a specified column field individually,thus ensuring a high flexibility of data processing when utilizing this platform.Results:In order to process biodiversity data,users do not need to install a workflow management system and create workflows,nor do they need to master complexcoding language such as Python or R.By acces
作者
邱金水
王亚楠
庄会富
Jinshui Qiu;Yanan Wang;Huifu Zhuang(Kunming Institute of Botany,Chinese Academyof Sciences,Kunming 650201;Kunming Institute of Zoology,Chinese Academy of Sciences,Kunming 650201)
出处
《生物多样性》
CAS
CSCD
北大核心
2022年第11期218-228,共11页
Biodiversity Science
基金
国家科技资源共享服务平台(国家重要野生植物种质资源库-NWPGRC-21)
云南省生物资源数字化开发应用(202002AA100007)
中国科学院网络安全和信息化专项(CAS-WX2022SDC-SJ01)。
关键词
物种名称处理
地理位置处理
经纬度处理
时间日期处理
数据格式处理
数据质量评测
species name processing
geographical location processing
longitude and latitude processing
time and date processing
data format processing
data quality evaluation