What and where: A context-based recommendation system for object insertion

Song-Hai Zhang; Zheng-Ping Zhou; Bin Liu; Xi Dong; Peter Hall

doi:10.1007/s41095-020-0158-8

Computational Visual Media (Apr 2020)

What and where: A context-based recommendation system for object insertion

Song-Hai Zhang,
Zheng-Ping Zhou,
Bin Liu,
Xi Dong,
Peter Hall

Affiliations

Song-Hai Zhang: Tsinghua University
Zheng-Ping Zhou: Tsinghua University
Bin Liu: Tsinghua University
Xi Dong: Tsinghua University
Peter Hall: Department of Computer Science Media Technology Research Center, University of Bath

DOI: https://doi.org/10.1007/s41095-020-0158-8
Journal volume & issue: Vol. 6, no. 1
pp. 79 – 93

Abstract

Read online

Abstract We propose a novel problem revolving around two tasks: (i) given a scene, recommend objects to insert, and (ii) given an object category, retrieve suitable background scenes. A bounding box for the inserted object is predicted in both tasks, which helps downstream applications such as semiautomated advertising and video composition. The major challenge lies in the fact that the target object is neither present nor localized in the input, and furthermore, available datasets only provide scenes with existing objects. To tackle this problem, we build an unsupervised algorithm based on object-level contexts, which explicitly models the joint probability distribution of object categories and bounding boxes using a Gaussian mixture model. Experiments on our own annotated test set demonstrate that our system outperforms existing baselines on all sub-tasks, and does so using a unified framework. Future extensions and applications are suggested.

Published in Computational Visual Media

ISSN: 2096-0433 (Print); 2096-0662 (Online)
Publisher: SpringerOpen
Country of publisher: China
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://www.springer.com/41095

About the journal

Abstract

Keywords