Published Versions 3 Vol 2 (1) : 158–170 2019
Download
FAIR Convergence Matrix: Optimizing the Reuse of Existing FAIR-Related Resources
257 33 0
Abstract & Keywords
Abstract: The FAIR Principles articulate the behaviors expected from digital artifacts that are Findable, Accessible, Interoperable and Reusable by machines and by people. Although by now widely accepted, the FAIR Principles by design do not explicitly consider actual implementation choices enabling FAIR behaviors. As different communities have their own, often well-established implementation preferences and priorities for data reuse, coordinating a broadly accepted, widely used FAIR implementation approach remains a global challenge. In an effort to accelerate broad community convergence on FAIR implementation options, the GO FAIR community has launched the development of the FAIR Convergence Matrix. The Matrix is a platform that compiles for any community of practice, an inventory of their self-declared FAIR implementation choices and challenges. The Convergence Matrix is itself a FAIR resource, openly available, and encourages voluntary participation by any self-identified community of practice (not only the GO FAIR Implementation Networks). Based on patterns of use and reuse of existing resources, the Convergence Matrix supports the transparent derivation of strategies that optimally coordinate convergence on standards and technologies in the emerging Internet of FAIR Data and Services.
Keywords: FAIR Implementation Choices and Challenges; Convergence; FAIR Communities
Acknowledgments
[1]
M.D. Wilkinson, M. Dumontier, I.J. Aalbersberg, G. Appleton, M. Axton, A. Baak et al., & B. Mons. The FAIR Guiding Principles for scientific data management and stewardship. Scientific Data 3 (160018) (2016). doi:10.1038/sdata.2016.18.
[2]
Wittenburg, P. et al. this issue, DI-2019-0025. State of FAIRness in ESFRIs.
[3]
Sansone, S.A., Susanna-Assunta Sansone, P. McQuilton, P. Rocca-Serra, A. Gonzalez-Beltran, M. Izzo, A.L. Lister, M. Thurston & the FAIRsharing Community. FAIRsharing as a community approach to standards, repositories and policies. Nature Biotechnology. 37 (2019) 358–367. doi:10.1038/s41587-019-0080-8
[4]
The FAIRsharing registry. Available at:https://fairsharing.org/standards
[5]
P. McQuilton, D. Batista, O. Beyan, R. Granell, S. Coles, M. Izzo, ... & S.-A. Sansone. Helping the consumers and producers of standards, repositories and policies to enable FAIR data. Data Intelligence 2(2020), 151–157. doi: 10.1162/dint_a_00037.
[6]
W3C working group on Semantic Web Activity. Information available at:https://www.w3.org/2001/sw/
[7]
W3C working group on Data Activity. Information available at:https://www.w3.org/2013/data/
[8]
Web Ontology Language. Information available at:https://www.w3.org/2001/sw/Specs#OWLSpecs
[9]
Wittenburg, P. & Strawn, G. (2018). Common Patterns in Revolutionary Infrastructures and Data. Available at:https://www.rd-alliance.org/sites/default/files/Common_Patterns_in_Revolutionising_Infrastructures-final.pdf
[12]
Report from the GO FAIR Implementation Networks meeting 2019. Available at:https://www.go-fair.org/2019/01/30/go-fair-implementation-networks-meeting/
[13]
A presentation giving an overview of the Matrix. Available at:https://osf.io/ceyvj/
[15]
Coles, S. et. al. this issue, DI-2019-0017. Taking FAIR on the ChIN: Chemistry Implementation Network.
[16]
Webpage of the Rare Disease Implementation Network. Available at:https://www.go-fair.org/implementation-networks/overview/rare-diseases/
[17]
Refinements of the Matrix questions: Available at:https://osf.io/xqfb9/
[18]
Dutch Techcentre for Life Sciences & Czech Technical University in Prague (2016). Data Stewardship Wizard (DSW). Available at: https://ds-wizard.org/.
[19]
Pergl, R. et al. this issue, DI-2019-0018. Data Management Planning: how requirements and solutions are beginning to converge.
[20]
The ENVRI-FAIR project website. Available at:http://envri.eu/envri-fair/
[21]
GO FAIR Reference Ontology. Available at:https://github.com/go-fair-ins/GO-FAIR-Ontology
[22]
Semantic data models of FAIR Implementation Choices and Challenges. Available at:https://w3id.org/fair/icc/terms/FAIR-ICC-Model
[23]
G. Guizzardi, & J. Mylopoulos. Taking It to the Next Level: Nicola Guarino, Formal Ontology and Conceptual Modeling. Frontiers in Artificial Intelligence and Applications. (2019) 223–241. doi:10.3233/978-1-61499-955-3-223
[24]
B. Mons, H. van Haagen, C. Chichester, P. Hoen, J.T. den Dunnen, G. van Ommen et al...& E. Schultes. The value of data. Nature Genetics. 43 (2011) 281–283. doi:10.1038/ng0411-281.
[25]
Kuhn, T. (2013). Broadening the Scope of Nanopublications. In: Cimiano P., Corcho O., Presutti V., Hollink L., Rudolph S. (eds) The Semantic Web: Semantics and Big Data. ESWC 2013. Lecture Notes in Computer Science, vol 7882. Springer, Berlin, Heidelberg. doi:10.1007/978-3-642-38288-8_33
[26]
Tobias Kuhn, A. Meroño-Peñuela, A. Malic, J. H. Poelen, A.H. Hurlbert, E. C. Ortiz et al... & M. Dumontier. (2018). Nanopublications: A Growing Resource of Provenance-Centric Scientific Linked Data. 2018 IEEE 14th International Conference on e-Science (e-Science). doi:10.1109/eScience.2018.00024
[28]
Tobias Kuhn, E. Willighagen, C. Evelo, N. Q. Rosinach C.Laura & I. Furlong (2017). Reliable Granular References to Changing Linked Data. In: d'Amato C. et al. (eds) The Semantic Web – ISWC 2017. ISWC 2017. Lecture Notes in Computer Science, vol 10587. Springer, Cham. doi:10.1007/978-3-319-68288-4_26
[29]
Global Soil Partnership. Information available at:http://www.fao.org/global-soil-partnership/en/
[30]
Encyclopaedia of Life. Available at:https://eol.org
[31]
Global Biodiversity Information Facility. Available at:https://www.gbif.org
[32]
International Chemical Identifier. Available at:https://www.inchi-trust.org
Article and author information
Cite As
H.P. Sustkova, K.M. Hettne, P. Wittenburg, A. Jacobsen, T. Kuhn, R. Pergl,... & E. Schultes. FAIR convergence matrix: Optimizing the reuse of existing FAIR-related resources. Data Intelligence 2(2020), 158–170. doi: 10.1162/dint_a_00038
Hana Pergl Sustkova
H.P. Sustkova supported the coalition development team; coordinated the team of authors, contributed, reviewed and commented on the manuscript; handled formatting.
hana.pergl@go-fair.org
Hana Pergl Sustkova is the Operations Officer at the GO FAIR International Support and Coordination Office (GFISCO). Hana supports key activities of the GFISCO including the FAIR Convergence Matrix coalition coordination. Prior to joining the GFISCO, Hana worked as project manager for the ELIXIR research infrastructure, which marked her transition from an international corporation to the academic sphere. Her background is business administration and management.
0000-0002-4462-6465
Kristina Maria Hettne
K.M. Hettne contributed to the overall conception, created a list of questions for the IN profile and analyzed the results, contributed to the list of questions for the Wizard implementation, contributed to the ontology and nanopublications, and contributed to the manuscript.
Kristina Hettne, PhD, is a Digital Scholarship Librarian at the Centre for Digital Scholarship, Leiden University Libraries in Leiden, The Netherlands. At the Centre, she helps researchers navigate Open Science and shape the future of research data management. She is the Centre’s liaison with GO FAIR and part of the FAIR Convergence Matrix development team for optimizing the reuse of existing FAIR-related resources. She obtained her PhD degree in bioinformatics of toxicogenomics from the University of Maastricht in 2012. She is a review editor for “Frontiers in Big Data”, member of the Advisory Board of the Wiley journal Genetics & Genomics Next, and co-author of more than 30 research publications.
0000-0002-4182-7560
Peter Wittenburg
Peter Wittenburg was Executive Director of Research Data Alliance (RDA) Europe, Member of RDA Technical Advisory Board, and Scientific Coordinator of European Data Infrastructure (EUDAT). He set up and led the Technical Group with about 30 experts at Max Planck Institute (MPI) for Psycholinguistics and then led the Language Archiving Group with about 25 experts. Since 2000 he has played leading roles in a variety of European (funded by the European Commission) and national projects (funded by MPS, DFG, BMBF, NWO 23) and ISO initiatives (ISO TC37/SC4). He won the Heinz Billing Award of the MPS for the advancement of scientific computation in 2011 and received an honorary doctorate from University Tübingen in 2013.
0000-0003-3538-0106
Annika Jacobsen
Annika Jacobsen is a postdoctoral researcher at the BioSemantics group, Human Genetics Department, Leiden University Medical Center, The Netherlands. She obtained her Bachelor and Master degrees at the Technical University of Denmark in 2009 and 2012, and her PhD degree at the Vrije Universiteit Amsterdam in 2019. Her research interests are to create interoperable FAIR rare disease data with the aim to learn more about cause, diagnosis and treatment.
0000-0003-4818-2360
Tobias Kuhn
T. Kuhn contributed to the overall conception and the ontology, created the nanopublications, and wrote the text about them.
Tobias Kuhn is an assistant professor at the Computer Science department of the VU University Amsterdam. After receiving his PhD at the Institute of Computational Linguistics of the University of Zurich in 2010, he worked at the University of Malta, Yale University, and ETH Zurich. His research interests span fields including knowledge representation, controlled natural language, socio-technical systems, and scholarly communication. His recent work focuses on the approach of nanopublications, how cryptographic methods and provenance modelling can support trust and reliability, and how this can support the initiatives around the FAIR principles for data management.
0000-0002-1267-0234
Robert Pergl
R. Pergl is leading the development team of Wizard and he is the main author and maintainer of the GO FAIR Core Ontology. In this paper, he contributed to the relevant sections and Figure 1.
Robert Pergl is an Associate Professor at Department of Software Engineering, Faculty of Information Technologies of Czech Technical University in Prague, Czech Republic, where he founded “Centre for Conceptual Modelling and Implementation”, a group focusing on research, development and applications of methods and tools for ontological engineering, enterprise engineering, software engineering and data stewardship. Robert Pergl is a National Node Committee member of ELIXIR Czech Republic. He is a member of several GO FAIR initiatives and projects and together with Rob Hooft he leads the Data Stewardship Wizard development. Contribution: Leading the authors’ team and authoring process, communications author, copy-editing and quality assurance, Data Stewardship Wizard details.
0000-0003-2980-4400
Jan Slifka
J. Slifka is a member of the Wizard development team and he is the author of the customizations of Wizard for the Matrix project. Together with R. Pergl, he authored the relevant sections.
Jan Slifka got his Master’s degree in Software Engineering at the Czech Technical University in Prague. He is now a PhD student there, focusing on evolvable systems and functional programming. He acquired hands-on experience from the industry while working as a senior developer for several startups with a wide range of technologies. He is also a member of ELIXIR-CZ Interoperability Platform where he works as a chief User Interface developer on the Data Stewardship Wizard. He collaborates with GO FAIR, building the FAIR Funding Ecosystem.
0000-0002-4941-0575
Peter McQuilton
P. McQuilton and S.-A. Sansone worked to connect FAIRsharing with the Wizard, and contributed to the manuscript.
Peter McQuilton holds a 1st class BSc (Hons) degree in Genetics from the University of Leeds (2000) and a PhD (2004) in Drosophila neurodevelopment from the University of Cambridge. Peter has spent over 15 years working in the fields of bioinformatics, biocuration and data wrangling, first as a genetic literature curator at FlyBase, the premier database on drosophila genes and genomes, and then as part of the Data Readiness Group at the Oxford e-Research Centre. As part of the Data Readiness Group, Peter leads the FAIRsharing project. FAIRsharing is a manually curated, searchable portal of interlinked standards, databases and policies, from all domains.
0000-0003-2687-1982
Barbara Magagna
Barbara Magagna holding a master degree in landscape planning and in geoinformatics, has 24 years of experience working in the field of GIS, landscape ecology modelling and database management for projects operating at different scales. Her interests and formation moved in the last years also towards ontology engineering and process facilitation, abilities she could already apply in several national and European semantic projects. She had been working for the University of Natural Resources and Applied Life Sciences, the University of Vienna and since December 2007 for the Federal Environment Agency (Umweltbundesamt GmbH) where she undertakes the function of a semantic analyst and database designer. She was involved in FP7 and H2020 projects as facilitator in the development process of terminologies like SERONTO and EnvThes. She has experience as work package lead related to data management, in the design of UML models and XML schemas in the air quality data reporting area, in the design of semantic models in projects related to Environmental Research Infrastructures (ENVRI).
0000-0003-2195-3997
Susanna-Assunta Sansone
P. McQuilton and S.-A. Sansone worked to connect FAIRsharing with the Wizard, and contributed to the manuscript.
Susanna-Assunta Sansone is an Associate Director, Associate Professor and Principal Investigator at the Oxford e-Research Centre, part of the Department of Engineering Science at the University of Oxford. She is one of the authors of the FAIR principles and an active contributor to a variety of community-driven FAIR-enabling efforts. Her group researches and develops methods and tools to improve data reuse, for data transparency, research integrity and the evolution of scholarly publishing: https://sansonegroup.eng.ox.ac.uk.
0000-0001-5306-5690
Markus Stocker
M. Stocker co-lead the related work in the ENVRI-FAIR project, reviewed and commented on the manuscript.
Markus Stocker is Head of the Knowledge Infrastructures Research Group at the TIB Leibniz Information Centre for Science and Technology. He holds a PhD in Environmental Informatics from the University of Eastern Finland; a MSc in Environmental Science from the University of Eastern Finland; and a Diploma (MSc) in Informatics from the University of Zurich, Switzerland. His research interests lie at the intersection between research infrastructures and research communities, and how such infrastructures acquire, maintain, and share scientific knowledge about human and natural worlds.
0000-0001-5492-3212
Melanie Imming
Melanie Imming authored reports on FAIR Data and Data Stewardship practices in The Netherlands for SURF and the Dutch National Contact Point for Research Data Management, LCRDM. As a consultant with a strong international network in the area of Open Science, FAIR Data and Digital Cultural Heritage, Melanie is experienced in project management and engaging stakeholders from different domains, and a known advocate for Open Science practices.
0000-0003-2376-9755
Larry Lannom
Larry Lannom is Director of Information Services and Vice President at the Corporation for National Research Initiatives (CNRI), where he works with organizations in both the public and private sectors to develop experimental and pilot applications of advanced networking and information management technologies. Mr Lannom’s current work is focused on CNRI’s Digital Object Architecture, which is based on the concept of the digital object, a uniform approach to representing digital information across computing and application environments, both now and into the future. Mr. Lannom joined CNRI in September of 1996. Prior to that, he was a Technical Director at DynCorp, Inc., where he served as an advisor on digital library research for the ISTO, CSTO, and ITO offices of the US Defense Advanced Research Projects Agency (DARPA), including initiating the Computer Science Technical Reports (CS-TR) project, DARPA’s first effort in the digital library area. In addition, he managed the development of internal information systems for DARPA. Originally trained as a librarian, his earlier work included reference book publishing and information retrieval research.
0000-0003-1254-7604
Mark Musen
Mark Musen is Professor of Biomedical Informatics and of Biomedical Data Science at Stanford University, where he is Director of the Stanford Center for Biomedical Informatics Research. Dr. Musen conducts research related to open science, intelligent systems, computational ontologies and biomedical decision support. His group developed Protégé, the world’s most widely used technology for building and managing terminologies and ontologies. He has served as principal investigator of the National Center for Biomedical Ontology and of the Center for Expanded Data Annotation and Retrieval (CEDAR). Dr. Musen directs the World Health Organization Collaborating Center for Classification, Terminology, and Standards at Stanford University, which has developed much of the information infrastructure for the authoring and management of the 11th edition of the International Classification of Diseases (ICD-11). Dr. Musen was the recipient of the Donald A. B. Lindberg Award for Innovation in Informatics from the American Medical Informatics Association in 2006. He has been elected to the American College of Medical Informatics, the Association of American Physicians, the International Academy of Health Sciences Informatics, and the National Academy of Medicine.
0000-0003-3325-793X
Erik Schultes
E. Schultes (erik.schultes@go-fair.org) conceived anddesigned the Convergence Matrix and its early-stage implementation, and directed the coalition developmentteam.
erik.schultes@go-fair.org
Erik Schultes is International Science Coordinator at the GO FAIR International Support and Coordination Office where he has been working with a diverse community of stakeholders to develop FAIR data and services. Erik is also a member of the Leiden Center for Data Science at Leiden University. Erik is an evolutionary biologist with long standing interests in data-intensive research. In addition to private consulting, he has held previous academic appointments at the University of California, Los Angeles, The Whitehead Institute for Biomedical Research at the Massachusetts Institute of Technology, Duke University, and The Santa Fe Institute.
0000-0001-8888-635X
FAIRsharing is funded by grants awarded to SAS that include elements of this work; specifically, grantsfrom the UK BBSRC and Research Councils (BB/L024101/1, BB/L005069/1), European Union (H2020-EU.3.1,634107, H2020-EU.1.4.1.3, 654241, H2020-EU.1.4.1.1, 676559), IMI (116060) and NIH (U54 AI117925,1U24AI117966-01, 1OT3OD025459-01, 1OT3OD025467-01, 1OT3OD025462-01) and the newFAIRsharing award from the Wellcome Trust (212930/Z/18/Z), as well as a related award (208381/A/17/Z). SAS is funded also by the Oxford e-Research Centre, Department of Engineering Science of the Universityof Oxford. The development of the Wizard technology was funded partially by ELIXIR, the European research infrastructure for life-science data and Institute of Organic Chemistry and Biochemistry AS CR. Considerable amount of work was done in kind by Faculty of Information Technology, Czech Technical University in Prague and by Dutch Techcentre for Life Sciences.
Publication records
Published: None (Versions3
References
Data Intelligence