SoftwareX (Jul 2020)

MVTS-Data Toolkit: A Python package for preprocessing multivariate time series data

  • Azim Ahmadzadeh,
  • Kankana Sinha,
  • Berkay Aydin,
  • Rafal A. Angryk

Journal volume & issue
Vol. 12
p. 100518

Abstract

Read online

We developed a domain-independent Python package to facilitate the preprocessing routines required in preparation of any multi-class, multivariate time series data. It provides a comprehensive set of 48 statistical features for extracting the important characteristics of time series. The feature extraction process is automated in a sequential and parallel fashion, and is supplemented with an extensive summary report about the data. Using other modules, different data normalization methods and imputation are at users’ disposal. To cater the class-imbalance issue, that is often intrinsic to real-world datasets, a set of generic but user-friendly, sampling methods are also developed.

Keywords