Patterns (Oct 2020)

The Ontologies Community of Practice: A CGIAR Initiative for Big Data in Agrifood Systems

  • Elizabeth Arnaud,
  • Marie-Angélique Laporte,
  • Soonho Kim,
  • Céline Aubert,
  • Sabina Leonelli,
  • Berta Miro,
  • Laurel Cooper,
  • Pankaj Jaiswal,
  • Gideon Kruseman,
  • Rosemary Shrestha,
  • Pier Luigi Buttigieg,
  • Christopher J. Mungall,
  • Julian Pietragalla,
  • Afolabi Agbona,
  • Jacqueline Muliro,
  • Jeffrey Detras,
  • Vilma Hualla,
  • Abhishek Rathore,
  • Roma Rani Das,
  • Ibnou Dieng,
  • Guillaume Bauchet,
  • Naama Menda,
  • Cyril Pommier,
  • Felix Shaw,
  • David Lyon,
  • Leroy Mwanzia,
  • Henry Juarez,
  • Enrico Bonaiuti,
  • Brian Chiputwa,
  • Olatunbosun Obileye,
  • Sandrine Auzoux,
  • Esther Dzalé Yeumo,
  • Lukas A. Mueller,
  • Kevin Silverstein,
  • Alexandra Lafargue,
  • Erick Antezana,
  • Medha Devare,
  • Brian King

Journal volume & issue
Vol. 1, no. 7
p. 100105

Abstract

Read online

Summary: Heterogeneous and multidisciplinary data generated by research on sustainable global agriculture and agrifood systems requires quality data labeling or annotation in order to be interoperable. As recommended by the FAIR principles, data, labels, and metadata must use controlled vocabularies and ontologies that are popular in the knowledge domain and commonly used by the community. Despite the existence of robust ontologies in the Life Sciences, there is currently no comprehensive full set of ontologies recommended for data annotation across agricultural research disciplines. In this paper, we discuss the added value of the Ontologies Community of Practice (CoP) of the CGIAR Platform for Big Data in Agriculture for harnessing relevant expertise in ontology development and identifying innovative solutions that support quality data annotation. The Ontologies CoP stimulates knowledge sharing among stakeholders, such as researchers, data managers, domain experts, experts in ontology design, and platform development teams. The Bigger Picture: Digital technology use in agriculture and agrifood systems research accelerates the production of multidisciplinary data, which spans genetics, environment, agroecology, biology, and socio-economics. Quality labeling of data secures its online findability, reusability, interoperability, and reliable interpretation, through controlled vocabularies organized into meaningful and computer-readable knowledge domains called ontologies. There is currently no full set of recommended ontologies for agricultural research, so data scientists, data managers, and database developers struggle to find validated terminology. The Ontologies Community of Practice of the CGIAR Platform for Big Data in Agriculture harnesses international expertise in knowledge representation and ontology development to produce missing ontologies, identifies best practices, and guides data labeling by teams managing multidisciplinary information platforms to release the FAIR data underpinning the evidence of research impact.

Keywords