First-person reading activity recognition by deep learning with synthetically generated images

Yuta Segawa; Kazuhiko Kawamoto; Kazushi Okamoto

doi:10.1186/s13640-018-0272-z

EURASIP Journal on Image and Video Processing (May 2018)

First-person reading activity recognition by deep learning with synthetically generated images

Yuta Segawa,
Kazuhiko Kawamoto,
Kazushi Okamoto

Affiliations

Yuta Segawa: NIFTY Corporation
Kazuhiko Kawamoto: Graduate School of Engineering, Chiba University
Kazushi Okamoto: Graduate School of Informatics and Engineering, The University of Electro-Communications

DOI: https://doi.org/10.1186/s13640-018-0272-z
Journal volume & issue: Vol. 2018, no. 1
pp. 1 – 13

Abstract

Read online

Abstract We propose a vision-based method for recognizing first-person reading activity with deep learning. For the success of deep learning, it is well known that a large amount of training data plays a vital role. Unlike image classification, there are less publicly available datasets for reading activity recognition, and the collection of book images might cause copyright trouble. In this paper, we develop a synthetic approach for generating positive training images. Our approach synthesizes computer-generated images and real backround images. In experiments, we show that this synthesis is effective in combination with pre-trained deep convolutional neural networks and also our trained neural network outperforms other baselines.

Published in EURASIP Journal on Image and Video Processing

ISSN: 1687-5176 (Print); 1687-5281 (Online)
Publisher: SpringerOpen
Country of publisher: United Kingdom
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Electronics
Website: https://jivp-eurasipjournals.springeropen.com

About the journal

Abstract

Keywords