Journal of Open Research Software (Jan 2023)

Taskfarm: A Client/Server Framework for Supporting Massive Embarrassingly Parallel Workloads

  • Magnus Hagdorn,
  • Noel Gourmelen

DOI
https://doi.org/10.5334/jors.393
Journal volume & issue
Vol. 11, no. 1

Abstract

Read online

Taskfarm is a client/server framework that can be used to keep track of massive embarrassingly parallel workloads. The system is split up into two packages: (1) a flask server that hands out new tasks via HTTP and (2) a python client that requests and updates tasks. The server stores task progress in a database. This system has been designed to manage a satellite data processing workflow with hundreds of thousands of tasks with variable compute costs. It can be used for any problem that can be solved using a task farm.

Keywords