Imputation for Repeated Bounded Outcome Data: Statistical and Machine-Learning Approaches

Urko Aguirre-Larracoechea; Cruz E. Borges

doi:10.3390/math9172081

Mathematics (Aug 2021)

Imputation for Repeated Bounded Outcome Data: Statistical and Machine-Learning Approaches

Urko Aguirre-Larracoechea,
Cruz E. Borges

Affiliations

Urko Aguirre-Larracoechea: Research Unit, Osakidetza Basque Health Service, Barrualde-Galdakao Integrated Health Organisation, Galdakao-Usansolo Hospital, 48960 Galdakao, Spain
Cruz E. Borges: Deusto Institute of Technology, Faculty of Engineering, University of Deusto, 48007 Bilbao, Spain

DOI: https://doi.org/10.3390/math9172081
Journal volume & issue: Vol. 9, no. 17
p. 2081

Abstract

Read online

Real-life data are bounded and heavy-tailed variables. Zero-one-inflated beta (ZOIB) regression is used for modelling them. There are no appropriate methods to address the problem of missing data in repeated bounded outcomes. We developed an imputation method using ZOIB (i-ZOIB) and compared its performance with those of the naïve and machine-learning methods, using different distribution shapes and settings designed in the simulation study. The performance was measured employing the absolute error (MAE), root-mean-square-error (RMSE) and the unscaled mean bounded relative absolute error (UMBRAE) methods. The results varied depending on the missingness rate and mechanism. The i-ZOIB and the machine-learning ANN, SVR and RF methods showed the best performance.

Published in Mathematics

ISSN: 2227-7390 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science: Mathematics
Website: http://www.mdpi.com/journal/mathematics

About the journal

Abstract

Keywords