A data management workflow of biodiversity data from the field to data users

Rachel A. Hackett; Michael W. Belitz; Edward E. Gilbert; Anna K. Monfils

doi:10.1002/aps3.11310

Applications in Plant Sciences (Dec 2019)

A data management workflow of biodiversity data from the field to data users

Rachel A. Hackett,
Michael W. Belitz,
Edward E. Gilbert,
Anna K. Monfils

Affiliations

Rachel A. Hackett: Department of Biology Institute for Great Lakes Research Central Michigan University Bioscience Building 2100, 1455 Calumet Court Mount Pleasant Michigan 48859 USA
Michael W. Belitz: Department of Biology Institute for Great Lakes Research Central Michigan University Bioscience Building 2100, 1455 Calumet Court Mount Pleasant Michigan 48859 USA
Edward E. Gilbert: School of Life Sciences Arizona State University Tempe Arizona 85287 USA
Anna K. Monfils: Department of Biology Institute for Great Lakes Research Central Michigan University Bioscience Building 2100, 1455 Calumet Court Mount Pleasant Michigan 48859 USA

DOI: https://doi.org/10.1002/aps3.11310
Journal volume & issue: Vol. 7, no. 12
pp. n/a – n/a

Abstract

Read online

Premise Heterogeneity of biodiversity data from the collections, research, and management communities presents challenges for data findability, accessibility, interoperability, and reusability. Workflows designed with data collection, standards, dissemination, and reuse in mind will generate better information across geopolitical, administrative, and institutional boundaries. Here, we present our data workflow as a case study of how we collected, shared, and used data from multiple sources. Methods In 2012, we initiated the collection of biodiversity data relating to Michigan prairie fens, including data on plant communities and the federally endangered Poweshiek skipperling (Oarisma poweshiek). Results Over 23,000 occurrence records were compiled in a database following Darwin Core standards. The records were linked with media and biological, chemical, and geometric measurements. We published the data as Global Biodiversity Information Facility data sets and in Symbiota SEINet portals. Discussion We highlight data collection techniques that optimized transcription time, including the use of predetermined and controlled vocabulary, Darwin Core terms, and data dictionaries. The validity and longevity of our data were supported by voucher specimens, metadata with measurement records, and published manuscripts detailing methods and data sets. Key to our data dissemination was cooperation among partners and the utilization of dynamic tools. To increase data interoperability, we need flexible and customizable data collection templates, coding, and enhanced communication among communities using biodiversity data.

Published in Applications in Plant Sciences

ISSN: 2168-0450 (Online)
Publisher: Wiley
Country of publisher: United States
LCC subjects: Science: Biology (General); Science: Botany
Website: https://bsapubs.onlinelibrary.wiley.com/journal/21680450

About the journal

Abstract

Keywords