Journal of Big Data (Sep 2023)

A guide to creating an effective big data management framework

  • S. T. Arundel,
  • K. G. McKeehan,
  • B. B. Campbell,
  • A. N. Bulen,
  • P. T. Thiem

DOI
https://doi.org/10.1186/s40537-023-00801-9
Journal volume & issue
Vol. 10, no. 1
pp. 1 – 22

Abstract

Read online

Abstract Many agencies and organizations, such as the U.S. Geological Survey, handle massive geospatial datasets and their auxiliary data and are thus faced with challenges in storing data and ingesting it, transferring it between internal programs, and egressing it to external entities. As a result, these agencies and organizations may inadvertently devote unnecessary time and money to convey data without existing or outdated standards. This research aims to evaluate the components of data conveyance systems, such as transfer methods, tracking, and automation, to guide their improved performance. Specifically, organizations face the challenges of slow dispatch time and manual intervention when conveying data into, within, and from their systems. Conveyance often requires skilled workers when the system depends on physical media such as hard drives, particularly when terabyte transfers are required. In addition, incomplete or inconsistent metadata may necessitate manual intervention, process changes, or both. A proposed solution is organization-wide guidance for efficient data conveyance. That guidance involves systems analysis to outline a data management framework, which may include understanding the minimum requirements of data manifests, specification of transport mechanisms, and improving automation capabilities.

Keywords