Persistent identifiers for research objects, researchers, organizations, and funders are the key to creating unambiguous and persistent connections across the global research infrastructure(GRI). Many repositories are...Persistent identifiers for research objects, researchers, organizations, and funders are the key to creating unambiguous and persistent connections across the global research infrastructure(GRI). Many repositories are implementing mechanisms to collect and integrate these identifiers into their submission and record curation processes. This bodes well for a well-connected future, but metadata for existing resources submitted in the past are missing these identifiers, thus missing the connections required for inclusion in the connected infrastructure. Re-curation of these metadata is required to make these connections. This paper introduces the global research infrastructure and demonstrates how repositories, and their user communities, can contribute to and benefit from connections to the global research infrastructure. The Dryad Data Repository has existed since 2008 and has successfully re-curated the repository metadata several times, adding identifiers for research organizations, funders, and researchers. Understanding and quantifying these successes depends on measuring repository and identifier connectivity. Metrics are described and applied to the entire repository here. Identifiers(Digital Object Identifiers, DOIs) for papers connected to datasets in Dryad have long been a critical part of the Dryad metadata creation and curation processes. Since 2019, the portion of datasets with connected papers has decreased from 100% to less than 40%. This decrease has significant ramifications for the re-curation efforts described above as connected papers have been an important source of metadata. In addition, missing connections to papers make understanding and re-using datasets more difficult. Connections between datasets and papers can be difficult to make because of time lags between submission and publication, lack of clear mechanisms for citing datasets and other research objects from papers, changing focus of researchers, and other obstacles. The Dryad community of members, i.e. users, research institutions, 展开更多
Data repository infrastructures for academics have appeared in waves since the dawn of Web technology.These waves are driven by changes in societal needs,archiving needs and the development of cloud computing resource...Data repository infrastructures for academics have appeared in waves since the dawn of Web technology.These waves are driven by changes in societal needs,archiving needs and the development of cloud computing resources.As such,the data repository landscape has many flavors when it comes to sustainability models,target audiences and feature sets.One thing that links all data repositories is a desire to make the content they host reusable,building on the core principles of cataloging content for economical and research speed efficiency.The FAIR principles are a common goal for all repository infrastructures to aim for.No matter what discipline or infrastructure,the goal of reusable content,for both humans and machines,is a common one.This is the first time that repositories can work toward a common goal that ultimately lends itself to interoperability.The idea that research can move further and faster as we un-silo these fantastic resources is an achievable one.This paper investigates the steps that existing repositories need to take in order to remain useful and relevant in a FAIR research world.展开更多
基金funded by the U.S. National Science Foundation (Crossref Funder ID: 100000001, ROR: https://ror.org/021nxhr62) Award 2134956
文摘Persistent identifiers for research objects, researchers, organizations, and funders are the key to creating unambiguous and persistent connections across the global research infrastructure(GRI). Many repositories are implementing mechanisms to collect and integrate these identifiers into their submission and record curation processes. This bodes well for a well-connected future, but metadata for existing resources submitted in the past are missing these identifiers, thus missing the connections required for inclusion in the connected infrastructure. Re-curation of these metadata is required to make these connections. This paper introduces the global research infrastructure and demonstrates how repositories, and their user communities, can contribute to and benefit from connections to the global research infrastructure. The Dryad Data Repository has existed since 2008 and has successfully re-curated the repository metadata several times, adding identifiers for research organizations, funders, and researchers. Understanding and quantifying these successes depends on measuring repository and identifier connectivity. Metrics are described and applied to the entire repository here. Identifiers(Digital Object Identifiers, DOIs) for papers connected to datasets in Dryad have long been a critical part of the Dryad metadata creation and curation processes. Since 2019, the portion of datasets with connected papers has decreased from 100% to less than 40%. This decrease has significant ramifications for the re-curation efforts described above as connected papers have been an important source of metadata. In addition, missing connections to papers make understanding and re-using datasets more difficult. Connections between datasets and papers can be difficult to make because of time lags between submission and publication, lack of clear mechanisms for citing datasets and other research objects from papers, changing focus of researchers, and other obstacles. The Dryad community of members, i.e. users, research institutions,
文摘Data repository infrastructures for academics have appeared in waves since the dawn of Web technology.These waves are driven by changes in societal needs,archiving needs and the development of cloud computing resources.As such,the data repository landscape has many flavors when it comes to sustainability models,target audiences and feature sets.One thing that links all data repositories is a desire to make the content they host reusable,building on the core principles of cataloging content for economical and research speed efficiency.The FAIR principles are a common goal for all repository infrastructures to aim for.No matter what discipline or infrastructure,the goal of reusable content,for both humans and machines,is a common one.This is the first time that repositories can work toward a common goal that ultimately lends itself to interoperability.The idea that research can move further and faster as we un-silo these fantastic resources is an achievable one.This paper investigates the steps that existing repositories need to take in order to remain useful and relevant in a FAIR research world.