Sensors (Apr 2022)

CEBA: A Data Lake for Data Sharing and Environmental Monitoring

  • David Sarramia,
  • Alexandre Claude,
  • Francis Ogereau,
  • Jérémy Mezhoud,
  • Gilles Mailhot

DOI
https://doi.org/10.3390/s22072733
Journal volume & issue
Vol. 22, no. 7
p. 2733

Abstract

Read online

This article presents a platform for environmental data named “Environmental Cloud for the Benefit of Agriculture” (CEBA). The CEBA should fill the gap of a regional institutional platform to share, search, store and visualize heterogeneous scientific data related to the environment and agricultural researches. One of the main features of this tool is its ease of use and the accessibility of all types of data. To answer the question of data description, a scientific consensus has been established around the qualification of data with at least the information “when” (time), “where” (geographical coordinates) and “what” (metadata). The development of an on-premise solution using the data lake concept to provide a cloud service for end-users with institutional authentication and for open data access has been completed. Compared to other platforms, CEBA fully supports the management of geographic coordinates at every stage of data management. A comprehensive JavaScript Objet Notation (JSON) architecture has been designed, among other things, to facilitate multi-stage data enrichment. Data from the wireless network are queried and accessed in near real-time, using a distributed JSON-based search engine.

Keywords