DeepPrimitive: Image decomposition by layered primitive detection

Jiahui Huang; Jun Gao; Vignesh Ganapathi-Subramanian; Hao Su; Yin Liu; Chengcheng Tang; Leonidas J. Guibas

doi:10.1007/s41095-018-0128-6

Computational Visual Media (Dec 2018)

DeepPrimitive: Image decomposition by layered primitive detection

Jiahui Huang,
Jun Gao,
Vignesh Ganapathi-Subramanian,
Hao Su,
Yin Liu,
Chengcheng Tang,
Leonidas J. Guibas

Affiliations

Jiahui Huang: Tsinghua University
Jun Gao: Computer Science Department, University of Toronto
Vignesh Ganapathi-Subramanian: Stanford University
Hao Su: University of California San Diego
Yin Liu: University of Wisconsin-Madison
Chengcheng Tang: Stanford University
Leonidas J. Guibas: Stanford University

DOI: https://doi.org/10.1007/s41095-018-0128-6
Journal volume & issue: Vol. 4, no. 4
pp. 385 – 397

Abstract

Read online

Abstract The perception of the visual world through basic building blocks, such as cubes, spheres, and cones, gives human beings a parsimonious understanding of the visual world. Thus, efforts to find primitive-based geometric interpretations of visual data date back to 1970s studies of visual media. However, due to the difficulty of primitive fitting in the pre-deep learning age, this research approach faded from the main stage, and the vision community turned primarily to semantic image understanding. In this paper, we revisit the classical problem of building geometric interpretations of images, using supervised deep learning tools. We build a framework to detect primitives from images in a layered manner by modifying the YOLO network; an RNN with a novel loss function is then used to equip this network with the capability to predict primitives with a variable number of parameters. We compare our pipeline to traditional and other baseline learning methods, demonstrating that our layered detection model has higher accuracy and performs better reconstruction.

Published in Computational Visual Media

ISSN: 2096-0433 (Print); 2096-0662 (Online)
Publisher: SpringerOpen
Country of publisher: China
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://www.springer.com/41095

About the journal

Abstract

Keywords