Enhancing Sensor Data Imputation: OWA-Based Model Aggregation for Missing Values

Muthana Al-Amidie; Laith Alzubaidi; Muhammad Aminul Islam; Derek T. Anderson

doi:10.3390/fi16060193

Future Internet (May 2024)

Enhancing Sensor Data Imputation: OWA-Based Model Aggregation for Missing Values

Muthana Al-Amidie,
Laith Alzubaidi,
Muhammad Aminul Islam,
Derek T. Anderson

Affiliations

Muthana Al-Amidie: Department of Electrical Engineering, University of Babylon, Babylon, Hilla 51001, Iraq
Laith Alzubaidi: School of Mechanical, Medical, and Process Engineering, Queensland University of Technology, Brisbane, QLD 4000, Australia
Muhammad Aminul Islam: Department of Electrical and Computer Engineering & Computer Science, University of New Haven, West Haven, CT 06516, USA
Derek T. Anderson: Department of Electrical Engineering & Computer Science, University of Missouri, Columbia, MO 65211, USA

DOI: https://doi.org/10.3390/fi16060193
Journal volume & issue: Vol. 16, no. 6
p. 193

Abstract

Read online

Due to some limitations in the data collection process caused either by human-related errors or by collection electronics, sensors, and network connectivity-related errors, the important values at some points could be lost. However, a complete dataset is required for the desired performance of the subsequent applications in various fields like engineering, data science, statistics, etc. An efficient data imputation technique is desired to fill in the missing data values to achieve completeness within the dataset. The fuzzy integral is considered one of the most powerful techniques for multi-source information fusion. It has a wide range of applications in many real-world decision-making problems that often require decisions to be made with partially observable/available information. To address this problem, algorithms impute missing data with a representative sample or by predicting the most likely value given the observed data. In this article, we take a completely different approach to the information fusion task in the ordered weighted averaging (OWA) context. In particular, we empirically explore for different distributions how the weights/importance of the missing sources are distributed across the observed inputs/sources. The experimental results on the synthetic and real-world datasets demonstrate the applicability of the proposed methods.

Published in Future Internet

ISSN: 1999-5903 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Information technology
Website: http://www.mdpi.com/journal/futureinternet/

About the journal

Abstract

Keywords