Environmental Challenges (Apr 2021)

A semantic approach to enable data integration for the domain of flood risk management

  • Vatsala Nundloll,
  • Rob Lamb,
  • Barry Hankin,
  • Gordon Blair

Journal volume & issue
Vol. 3
p. 100064

Abstract

Read online

With so many things around us continuously producing and processing data, be it mobile phones, or sensors attached to devices, or satellites sitting thousands of kilometres above our heads, data is becoming increasingly heterogeneous. Scientists are inevitably faced with data challenges, coined as the 4 V’s of data - volume, variety, velocity and veracity. In this paper, we address the issue of data variety. The task of integrating and querying such heterogeneous data is further compounded if the data is in unstructured form. We hence propose an approach using Semantic Web and Natural Language Processing techniques to resolve the heterogeneity arising in data formats, bring together structured and unstructured data and provide a unified data model to query from disparate data sets.

Keywords