Biodiversity Data Journal (Dec 2014)

A semi-automated workflow for biodiversity data retrieval, cleaning, and quality control

  • Cherian Mathew,
  • Anton Güntsch,
  • Matthias Obst,
  • Saverio Vicario,
  • Robert Haines,
  • Alan Williams,
  • Yde de Jong,
  • Carole Goble

DOI
https://doi.org/10.3897/BDJ.2.e4221
Journal volume & issue
Vol. 2
pp. 1 – 12

Abstract

Read online

The compilation and cleaning of data needed for analyses and prediction of species distributions is a time consuming process requiring a solid understanding of data formats and service APIs provided by biodiversity informatics infrastructures. We designed and implemented a Taverna-based Data Refinement Workflow which integrates taxonomic data retrieval, data cleaning, and data selection into a consistent, standards-based, and effective system hiding the complexity of underlying service infrastructures. The workflow can be freely used both locally and through a web-portal which does not require additional software installations by users.

Keywords