Data in Brief (Jun 2023)

A dataset for fault detection and diagnosis of an air handling unit from a real industrial facility

  • Michael Ahern,
  • Dominic T.J. O'Sullivan,
  • Ken Bruton

Journal volume & issue
Vol. 48
p. 109208

Abstract

Read online

This dataset was collected for the purpose of applying fault detection and diagnosis (FDD) techniques to real data from an industrial facility. The data for an air handling unit (AHU) is extracted from a building management system (BMS) and aligned with the Project Haystack naming convention. This dataset differs from other publicly available datasets in three main ways. Firstly, the dataset does not contain fault detection ground truth. The lack of labelled datasets in the industrial setting is a significant limitation to the application of FDD techniques found in the literature. Secondly, unlike other publicly available datasets that typically record values every 1 min or 5 min, this dataset captures measurements at a lower frequency of every 15 min, which is due to data storage constraints. Thirdly, the dataset contains a myriad of data issues. For example, there are missing features, missing time intervals, and inaccurate data. Therefore, we hope this dataset will encourage the development of robust FDD techniques that are more suitable for real world applications.

Keywords