Published Versions 4 Vol 2 (1) : 208–219 2019
Download
Data Management Planning: How Requirements and Solutions are Beginning to Converge
199 17 0
Abstract & Keywords
Abstract: Effective stewardship of data is a critical precursor to making data FAIR. The goal of this paper is to bring an overview of current state of the art of data management and data stewardship planning solutions (DMP). We begin by arguing why data management is an important vehicle supporting adoption and implementation of the FAIR principles, we describe the background, context and historical development, as well as major driving forces, being research initiatives and funders. Then we provide an overview of the current leading DMP tools in the form of a table presenting the key characteristics. Next, we elaborate on emerging common standards for DMPs, especially the topic of machine-actionable DMPs. As sound DMP is not only a precursor of FAIR data stewardship, but also an integral part of it, we discuss its positioning in the emerging FAIR tools ecosystem. Capacity building and training activities are an important ingredient in the whole effort. Although not being the primary goal of this paper, we touch also the topic of research workforce support, as tools can be just as much effective as their users are competent to use them properly. We conclude by discussing the relations of DMP to FAIR principles, as there are other important connections than just being a precursor.
Keywords: FAIR; Data management; Data stewardship; Machine-actionable; DMP tools
Acknowledgements
Many thanks to the DMP tool providers who shared information on their services and validated the tabular summary. These include Brian Riley of DMPTool, Harry Enke, Jochen Klar and Jens Ludwig of RDMOrganiser, Maria Iozzi, Hanne Moa and Adil Hasan of EasyDMP, Helen Connick and Andrew Janke of UQRDM, Erich Weichselgartner and Ronny Bölter of DataWiz, Kerstin Lehnert of ezDMP and Georgios Kakaletris of OpenDMP.The development of Data Stewardship Wizard was funded partially by ELIXIR, the European research infrastructure for life-science data and Institute of Organic Chemistry and Biochemistry AS CR. Several hackathons and workshops were organised and funded by GO FAIR International Support and Coordination Office. Considerable amount of work was done in kind by Dutch Techcentre for Life Sciences and Faculty of Information Technology, Czech Technical University in Prague.
[1]
S. Jones. Developments in research funder data policy. International Journal of Digital Curation 7(2012),
114–125. doi: 10.2218/ijdc.v7i1.219.
[2]
National Institutes of Health, NIH Data Sharing Policy and Implementation Guidance, 2003. Available at:
https://grants.nih.gov/grants/policy/data_sharing/data_sharing_guidance.html.
[3]
SPARC Europe, Latest update to European Open Data and Open Science Policies 2018. Available at: https://
sparceurope.org/latest-update-to-european-open-data-and-open-science-policies-released/.
[4]
Digital Curation Centre, Data Management Planning Themes, 2016. Available at: http://www.dcc.ac.uk/sites/default/files/documents/publications/DMP-themes.pdf.
[5]
S. Jones. A common set of themes for DMPs: seeking input. Digital Curation Centre, 2016. Available at: http://www.dcc.ac.uk/blog/common-set-themes-dmps-seeking-input.
[6]
Digital Curation Centre, DMPRoadmap: DCC/UC3 collaboration for a data management planning tool –Codebase, 2010. Available at: https://github.com/DMPRoadmap/roadmap.
[7]
P. Wittenburg, H. Pergl Sustkova, A. Montesanti, S.M. Bloemers, S.H. de Waard, M.A. Musen … & E.A.Schultes. The FAIR funder pilot programme to make it easy for funders to require and for grantees to produceFAIR data, 2019. Available at: http://arxiv.org/abs/1902.11162.
[8]
M. Donnelly, S. Jones & J.W. Pattenden-Fail. DMP online: The digital curation centre’s Web-based tool forcreating, maintaining and exporting data management plans. International Journal of Digital Curation 5(2010), 187–193. doi: 10.2218/ijdc.v5i1.152.
[9]
Digital Curation Centre, DMPonline, 2010. Available at: https://dmponline.dcc.ac.uk/.
[10]
Digital Curation Centre, DMPOnline – Help for administrators, 2010. Available at: https://github.com/DigitalCurationCentre/roadmap/wiki.
[11]
Digital Curation Centre, DMPRoadmap – API Documentation, 2010. Available at: https://github.com/DMPRoadmap/roadmap/wiki/API-Documentation.
[12]
California Digial Library, DMPTool, 2010. Available at: https://dmptool.org/.
[13]
California Digial Library, DMPTool – Quick start guide, 2011. Available at: https://dmptool.org/help.
[14]
EUDAT, UNINETT Sigma2, EasyDMP, 2015. Available at: https://easydmp.sigma2.no/.
[15]
EUDAT, UNINETT Sigma2, EasyDMP – User Documentation, 2015. Available at: https://www.sigma2.no/
easydmp/how-to.
[16]
EUDAT, UNINETT Sigma2, EasyDMP – Codebase, 2015. Available at: https://github.com/hmpf/easydmp.
[17]
Dutch Techcentre for Life Sciences, Czech Technical University in Prague, Data Stewardship Wizard (DSW),
2016. Available at: https://ds-wizard.org/.
[18]
Dutch Techcentre for Life Sciences, Czech Technical University in Prague, Data Stewardship Wizard (DSW)
– Documentation, 2016. Available at: https://docs.ds-wizard.org/.
[19]
Leibniz-Institut für Astrophysik Potsdam, Library of the Karlsruhe Institute of Technology, Research Data Management Organiser (RDMO) – Webpage, 2017. Available at: https://rdmorganiser.github.io/en/.
[20]
Leibniz-Institut für Astrophysik Potsdam, Library of the Karlsruhe Institute of Technology, Research Data Management Organiser (RDMO) – Tutorials, 2017. Available at: https://rdmorganiser.github.io/en/tutorials/.
[21]
Leibniz-Institut für Astrophysik Potsdam, Library of the Karlsruhe Institute of Technology, Research Data Management Organiser (RDMO) – Codebase, 2017. Available at: https://github.com/rdmorganiser.
[22]
The University of Queensland, UQ Research Data Manager (UQRDM), 2017. Available at: https://cloud.rdm.uq.edu.au/.
[23]
The University of Queensland, UQ Research Data Manager (UQRDM) – Documentation, 2017. Available at: https://guides.library.uq.edu.au/for-researchers/uq-research-data-manager/set-up-project.
[24]
Leibniz Institute for Psychology Information and Documentation, DataWiz, 2017. Available at: https://
datawiz.leibniz-psychology.org/DataWiz/.
[25]
Leibniz Institute for Psychology Information and Documentation, DataWiz – User Guide, 2017. Available at: https://datawizkb.leibniz-psychology.org/index.php/project-start.
[26]
Leibniz Institute for Psychology Information and Documentation, DataWiz – Codebase, 2017. Available at:
https://github.com/ZPID/DataWiz.
[27]
OpenAIRE, EUDAT, OpenDMP, 2019. URL: https://opendmp.eu.
[28]
OpenAIRE, EUDAT, OpenDMP – Codeebase, 2019. Available at: https://gitlab.eudat.eu/dmp.
[29]
T. Miksa, P. Walk, P. Neish, DMP Common Standards WG – Group’s Web Page, 2017. Available at: https://www.rd-alliance.org/groups/dmp-common-standards-wg.
[30]
S. Simms, S. Jones, D. Mietchen & T. Miksa. Machine-actionable data management plans (maDMPs). Research Ideas and Outcomes 3 (2017), e13086. doi: 10.3897/rio.3.e13086.
[31]
RDA DMP Common Standards WG, Official outputs from the RDA DMP Common Standards WG, 2019.
URL: https://github.com/RDA-DMP-Common/RDA-DMP-Common-Standard.
[32]
RDA DMP Common Standards WG, Mockups: Machine-actionable Data Management Planning Application, 2019. Available at: https://oblassers.github.io/dmap-mockups/.
[33]
T. Miksa, S. Simms, D. Mietchen & S. Jones, Ten principles for machine-actionable data management plans. PLOS Computational Biology 15(3) (2019), e1006750. doi: 10.1371/journal.pcbi.1006750.
[34]
V. Lucas. Skills gap analysis and curricula for data intensive digital skills in global environmental change research, technical report. Zenodo. 2017. Available at: https://zenodo.org/record/1404652. doi:10.5281/zenodo.1404652.
[35]
CODATA, CODATA-RDA School of Research Data Science. 2019. Available at: http://www.codata.org/work ing-groups/research-data-science-summer-schools.
[36]
FAIRsFAIR, FAIRsFAIR – Fostering Fair data practices in Europe – project web page, 2019. Available at: https://www.fairsfair.eu/.
Article and author information
Cite As
S. Jones, R. Pergl, R. Hooft, T. Miksa, R. Samors, J. Ungvari, R.I. Davis & T. Lee. Data management planning: How requirements and solutions are beginning to converge. Data Intelligence 2(2020), 208–219. doi: 10.1162/dint_a_00043
Sarah Jones
Sarah Jones (sarah.jones@glasgow.ac.uk) is the main author of the contents.
Sarah Jones coordinates work on the DCC’s Data Management Planning tool – DMPonline – and undertakes research on data policy and data management planning. Sarah is involved in several European Commission funded projects including, FOSTER+, OpenAIRE and Research Data Alliance Europe 4.0. Her work in a European context focuses primarily on training, data management planning and network building to facilitate open science. She co-chairs the RDA Active DMP Interest Group and the CODATA Working Group on Research Data Science schools. In a personal capacity, she is rapporteur on the European Commission’s FAIR Data Expert Group and a member of the Open Science Transport Research Cloud Expert Group. In previous roles, Sarah led the Data Audit Framework project and Incremental. She worked in HATII at the University of Glasgow from 2006–2017, initially for the AHDS Performing Arts data centre and then for the DCC, and is now based in Glasgow University Library.
0000-0002-5094-7126
Robert Pergl
Robert Pergl (perglr@fit.cvut.cz) was coordinating the authors team and authoring process, copy-edited the text and authored DataStewardship Wizard details together with Rob Hooft.
perglr@fit.cvut. cz
Robert Pergl is an Associate Professor at Department of Software Engineering, Faculty of Information Technologies of Czech Technical University in Prague, Czech Republic, where he founded “Centre for Conceptual Modelling and Implementation”, a group focusing on research, development and applications of methods and tools for ontological engineering, enterprise engineering, software engineering and data stewardship. Robert Pergl is a National Node Committee member of ELIXIR Czech Republic. He is a member of several GO FAIR initiatives and projects and together with Rob Hooft he leads the Data Stewardship Wizard development. Contribution: Leading the authors’ team and authoring process, communications author, copy-editing and quality assurance, Data Stewardship Wizard details.
0000-0003-2980-4400
Rob Hooft
Robert Pergl (perglr@fit.cvut.cz) was coordinating the authors team and authoring process, copy-edited the text and authored DataStewardship Wizard details together with Rob Hooft.
Rob Hooft is Manager of the Dutch tasks in the European ELIXIR infrastructurefor life science data, at the Dutch Techcenter for Life Sciences (DTL). After working for many years in the industry, Rob moved back to the academic world and joined the Netherlands center for Bioinformatics, NBIC, as CTO for the service-directed program. Via a two-year excursion to the Netherlands eScience Center from where he ran the data program of DTL he is now working for DTL itself. Rob has been building up a body of knowledge on FAIR research data stewardship since early 2014. Rob also represents ELIXIR relations in the Research Data Alliance.
0000-0001-6825-9439
Tomasz Miksa
Tomasz Miksa (tmiksa@sba-research.org) authored theinformation about RDA Working Group on Common Standards for DMPs and machine-actionable DMPs.
Tomasz Miksa has been working as a researcher at SBA Research since October 2012. He received in 2011 his MSc in systems and computer networks from the Wroclaw University of Technology, Poland. In 2016 he received his PhD in computer science from the TU Wien for his work on verification and validation of scientific workflow re-executions. He was involved in preservation of business processes in the EU-funded FP7 project TIMBUS. Furthermore, he took part in the FP7 4C Project which aimed to clarify the costs of curation of digital assets. Currently, he is a chair of the DMP Common Standards working group at the Research Data Alliance (RDA) and a co-founder of RDA Austria. His research focuses on reproducibility of eScience experiments and machine-actionable data management plans. Topics of interests include, but are not limited to: experiment context modelling, verification and validation, data repository architectures and workflows, digital curation and preservation.
0000-0002-4929-7875
Robert Samors
R. Samors (miksa@ifs.tuwien.ac.at), J. Ungvari (jungvari@gmail.com), R. Davis (rowenaidavis@email.arizona.edu) and T. Lee (tinal@email.arizona.edu) collectively wrote and revised information in Section 5 on the Belmont Forum's research workforce support activities and resources and made additional contributions to information in the remainder of the article.
Robert Samors serves as the Coordination Officer for the Belmont Forum e-Infrastructures & Data Management Project. In that role, he coordinates and liaises with e-I & DM project leads, Action Theme co-leads, stakeholders, Advisory Group and Oversight Committee members, and international partners to encourage the adoption of data principles and best practices, promote effective data planning and stewardship, and develop training curricula to enable practitioners to put those principles and practices into action through Belmont Forum agency activities and funded projects. Prior to joining the Belmont Forum, Mr. Samors served as Senior External Relations Manager for the Group on Earth Observations (GEO) designing and implementing GEO’s engagement strategy. He has worked closely with experts and global leaders in data and information issues across a range of governments and international scientific and technical organizations. His earlier positions have included serving as Associate Vice President for Innovation and Technology Policy at the Association of Public and Land-grant Universities (APLU), as Associate Vice President for Federal Relations for the University of North Carolina System, and as Assistant Vice President for Research at the University of Michigan. He holds a Masters in Public Policy from the Harvard University Kennedy School of Government, and a B.A. in Economics from Brown University.
0000-0003-3737-0120
Judit Ungvari
R. Samors (miksa@ifs.tuwien.ac.at), J. Ungvari (jungvari@gmail.com), R. Davis (rowenaidavis@email.arizona.edu) and T. Lee (tinal@email.arizona.edu) collectively wrote and revised information in Section 5 on the Belmont Forum's research workforce support activities and resources and made additional contributions to information in the remainder of the article.
Judit Ungvari is an ecologist by training, with expertise in aspects of avian biology in tropical habitats. She studied birds in the Peruvian Amazon region combining both lab- and field-based research and received her PhD degree in Zoology with a certificate and concentration in Tropical Conservation and Development at the University of Florida in 2016. Judit then worked as a postdoctoral scholar at the Smithsonian Institute in Washington, DC, addressing conservation issues in agroecosystems in Colombia. She has become involved in local capacity building and community outreach both in the USA and Latin America and has mentored dozens of students to complete independent research projects. Her interests include increasing diversity and broadening participation in the sciences, sustainability science, science diplomacy, supporting open and reproducible research efforts, and communicating science to the public, especially in museum settings. As a AAAS Science & Tech Policy fellow at the National Science Foundation, Judit is working on various international activities facilitating transdisciplinary global change research, including the advancement of e-infrastructures and data management planning.
0000-0002-5180-8048
Rowena I. Davis
R. Samors (miksa@ifs.tuwien.ac.at), J. Ungvari (jungvari@gmail.com), R. Davis (rowenaidavis@email.arizona.edu) and T. Lee (tinal@email.arizona.edu) collectively wrote and revised information in Section 5 on the Belmont Forum's research workforce support activities and resources and made additional contributions to information in the remainder of the article.
Rowena I. Davis was a project coordinator for the Belmont Forum e-Infrastructures and Data Management project, a three-year project (2016–2019) facilitating data sharing among teams performing research for the Belmont Forum, a coalition of major and emerging international funders of global environmental change research. She has been co-chair of the EarthCube Liaison Team (2018–2019) and a co-chair of the RDA Mapping the Data Landscape Interest Group (2017–2019).
0000-0002-9424-0325
Tina Lee
R. Samors (miksa@ifs.tuwien.ac.at), J. Ungvari (jungvari@gmail.com), R. Davis (rowenaidavis@email.arizona.edu) and T. Lee (tinal@email.arizona.edu) collectively wrote and revised information in Section 5 on the Belmont Forum's research workforce support activities and resources and made additional contributions to information in the remainder of the article.
Tina Lee was the Principal Investigator for the Belmont Forum’s e-Infrastructures and Data Management Project, a three-year project (2016–2019) whose goal was to make operational the Belmont Forum Open Data Policy & Principles in its collaborative funding program. Funded by the US National Science Foundation and four other international science funding agencies, the e-I&DM project coordinated with numerous international data and research organizations to develop resources for data management planning and training for Belmont Forum’s global environmental change community. She is currently the user engagement officer for the CyVerse project, a cyberinfrastructure platform for life sciences computational research based at the The University of Arizona in Tucson.
0000-0002-5284-7751
Publication records
Published: None (Versions4
References
Data Intelligence