Engineering (Mar 2020)

A Geometric Understanding of Deep Learning

  • Na Lei,
  • Dongsheng An,
  • Yang Guo,
  • Kehua Su,
  • Shixia Liu,
  • Zhongxuan Luo,
  • Shing-Tung Yau,
  • Xianfeng Gu

Journal volume & issue
Vol. 6, no. 3
pp. 361 – 374

Abstract

Read online

This work introduces an optimal transportation (OT) view of generative adversarial networks (GANs). Natural datasets have intrinsic patterns, which can be summarized as the manifold distribution principle: the distribution of a class of data is close to a low-dimensional manifold. GANs mainly accomplish two tasks: manifold learning and probability distribution transformation. The latter can be carried out using the classical OT method. From the OT perspective, the generator computes the OT map, while the discriminator computes the Wasserstein distance between the generated data distribution and the real data distribution; both can be reduced to a convex geometric optimization process. Furthermore, OT theory discovers the intrinsic collaborative—instead of competitive—relation between the generator and the discriminator, and the fundamental reason for mode collapse. We also propose a novel generative model, which uses an autoencoder (AE) for manifold learning and OT map for probability distribution transformation. This AE–OT model improves the theoretical rigor and transparency, as well as the computational stability and efficiency; in particular, it eliminates the mode collapse. The experimental results validate our hypothesis, and demonstrate the advantages of our proposed model. Keywords: Generative, Adversarial, Deep learning, Optimal transportation, Mode collapse