A deep Koopman operator‐based modelling approach for long‐term prediction of dynamics with pixel‐level measurements

Yongqian Xiao; Zixin Tang; Xin Xu; Xinglong Zhang; Yifei Shi

doi:10.1049/cit2.12149

CAAI Transactions on Intelligence Technology (Feb 2024)

A deep Koopman operator‐based modelling approach for long‐term prediction of dynamics with pixel‐level measurements

Yongqian Xiao,
Zixin Tang,
Xin Xu,
Xinglong Zhang,
Yifei Shi

Affiliations

Yongqian Xiao: College of Intelligence Science and Technology National University of Defense Technology Changsha China
Zixin Tang: College of Intelligence Science and Technology National University of Defense Technology Changsha China
Xin Xu: College of Intelligence Science and Technology National University of Defense Technology Changsha China
Xinglong Zhang: College of Intelligence Science and Technology National University of Defense Technology Changsha China
Yifei Shi: College of Intelligence Science and Technology National University of Defense Technology Changsha China

DOI: https://doi.org/10.1049/cit2.12149
Journal volume & issue: Vol. 9, no. 1
pp. 178 – 196

Abstract

Read online

Abstract Although previous studies have made some clear leap in learning latent dynamics from high‐dimensional representations, the performances in terms of accuracy and inference time of long‐term model prediction still need to be improved. In this study, a deep convolutional network based on the Koopman operator (CKNet) is proposed to model non‐linear systems with pixel‐level measurements for long‐term prediction. CKNet adopts an autoencoder network architecture, consisting of an encoder to generate latent states and a linear dynamical model (i.e., the Koopman operator) which evolves in the latent state space spanned by the encoder. The decoder is used to recover images from latent states. According to a multi‐step ahead prediction loss function, the system matrices for approximating the Koopman operator are trained synchronously with the autoencoder in a mini‐batch manner. In this manner, gradients can be synchronously transmitted to both the system matrices and the autoencoder to help the encoder self‐adaptively tune the latent state space in the training process, and the resulting model is time‐invariant in the latent space. Therefore, the proposed CKNet has the advantages of less inference time and high accuracy for long‐term prediction. Experiments are performed on OpenAI Gym and Mujoco environments, including two and four non‐linear forced dynamical systems with continuous action spaces. The experimental results show that CKNet has strong long‐term prediction capabilities with sufficient precision.

Published in CAAI Transactions on Intelligence Technology

ISSN: 2468-2322 (Online)
Publisher: Wiley
Country of publisher: United Kingdom
LCC subjects: Language and Literature: Philology. Linguistics: Computational linguistics. Natural language processing; Science: Mathematics: Instruments and machines: Electronic computers. Computer science: Computer software
Website: https://ietresearch.onlinelibrary.wiley.com/journal/24682322

About the journal

Abstract

Keywords