Journal of Big Data (Nov 2024)

The application of adaptive group LASSO imputation method with missing values in personal income compositional data

  • Ying Tian,
  • Majid Khan Majahar Ali,
  • Lili Wu

DOI
https://doi.org/10.1186/s40537-024-01009-1
Journal volume & issue
Vol. 11, no. 1
pp. 1 – 20

Abstract

Read online

Abstract From social and economic perspectives, compositional data represent the proportions of various components within a whole, carrying non-negative values and providing only relative information. However, in many circumstances, there are often a significant number of missing values in datasets. Due to the complexity caused by these missing values, traditional estimation methods are ineffective. In this paper, an adaptive group LASSO-based imputation method is proposed for compositional data, consolidating the advantages of group LASSO and adaptive LASSO analysis techniques. Considering the impact of outliers on the accuracy of estimation, both simulation and case analysis are conducted to compare the proposed algorithm against four existing methods. The experimental results demonstrate that the proposed adaptive group LASSO method produces a better imputation performance at comparable missing rates.

Keywords