Physics and chemistry from parsimonious representations: image analysis via invariant variational autoencoders

Mani Valleti; Maxim Ziatdinov; Yongtao Liu; Sergei V. Kalinin

doi:10.1038/s41524-024-01250-5

npj Computational Materials (Aug 2024)

Physics and chemistry from parsimonious representations: image analysis via invariant variational autoencoders

Mani Valleti,
Maxim Ziatdinov,
Yongtao Liu,
Sergei V. Kalinin

Affiliations

Mani Valleti: Bredesen Center for Interdisciplinary Research, University of Tennessee
Maxim Ziatdinov: Physical Sciences Division, Pacific Northwest National Laboratory
Yongtao Liu: Center for Nanophase Materials Sciences, Oak Ridge National Laboratory
Sergei V. Kalinin: Physical Sciences Division, Pacific Northwest National Laboratory

DOI: https://doi.org/10.1038/s41524-024-01250-5
Journal volume & issue: Vol. 10, no. 1
pp. 1 – 19

Abstract

Read online

Abstract Electron, optical, and scanning probe microscopy methods are generating ever increasing volume of image data containing information on atomic and mesoscale structures and functionalities. This necessitates the development of the machine learning methods for discovery of physical and chemical phenomena from the data, such as manifestations of symmetry breaking phenomena in electron and scanning tunneling microscopy images, or variability of the nanoparticles. Variational autoencoders (VAEs) are emerging as a powerful paradigm for the unsupervised data analysis, allowing to disentangle the factors of variability and discover optimal parsimonious representation. Here, we summarize recent developments in VAEs, covering the basic principles and intuition behind the VAEs. The invariant VAEs are introduced as an approach to accommodate scale and translation invariances present in imaging data and separate known factors of variations from the ones to be discovered. We further describe the opportunities enabled by the control over VAE architecture, including conditional, semi-supervised, and joint VAEs. Several case studies of VAE applications for toy models and experimental datasets in Scanning Transmission Electron Microscopy are discussed, emphasizing the deep connection between VAE and basic physical principles. Python codes and datasets discussed in this article are available at https://github.com/saimani5/VAE-tutorials and can be used by researchers as an application guide when applying these to their own datasets.

Published in npj Computational Materials

ISSN: 2057-3960 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Materials of engineering and construction. Mechanics of materials; Science: Mathematics: Instruments and machines: Electronic computers. Computer science: Computer software
Website: https://www.nature.com/npjcompumats/

About the journal