Future Internet (May 2024)

Enhancing Sensor Data Imputation: OWA-Based Model Aggregation for Missing Values

  • Muthana Al-Amidie,
  • Laith Alzubaidi,
  • Muhammad Aminul Islam,
  • Derek T. Anderson

DOI
https://doi.org/10.3390/fi16060193
Journal volume & issue
Vol. 16, no. 6
p. 193

Abstract

Read online

Due to some limitations in the data collection process caused either by human-related errors or by collection electronics, sensors, and network connectivity-related errors, the important values at some points could be lost. However, a complete dataset is required for the desired performance of the subsequent applications in various fields like engineering, data science, statistics, etc. An efficient data imputation technique is desired to fill in the missing data values to achieve completeness within the dataset. The fuzzy integral is considered one of the most powerful techniques for multi-source information fusion. It has a wide range of applications in many real-world decision-making problems that often require decisions to be made with partially observable/available information. To address this problem, algorithms impute missing data with a representative sample or by predicting the most likely value given the observed data. In this article, we take a completely different approach to the information fusion task in the ordered weighted averaging (OWA) context. In particular, we empirically explore for different distributions how the weights/importance of the missing sources are distributed across the observed inputs/sources. The experimental results on the synthetic and real-world datasets demonstrate the applicability of the proposed methods.

Keywords