IEEE Access (Jan 2023)
RDIS: Random Drop Imputation With Self-Training for Incomplete Time Series Data
Abstract
Time-series data with missing values are a common occurrence in various fields, including healthcare, meteorology, and robotics. The process of imputation aims to fill in the missing values with valid values. Most imputation methods implicitly train models due to the presence of missing values. In this paper, we propose Random Drop Imputation with Self-training (RDIS), a novel training method for time-series data imputation models. In RDIS, we generate extra missing values by applying a random drop to the observed values in incomplete data. We can explicitly train the imputation models by filling in the missing values. Moreover, we utilize self-training with pseudo values to exploit the original missing values. To enhance the quality of pseudo values, we set a threshold and filter them based on entropy calculation. To evaluate the effectiveness of RDIS for imputing time-series data, we test it across several imputation models and obtain competitive results on three real-world datasets.
Keywords