Coarse-to-fine online learning for hand segmentation in egocentric video

Ying Zhao; Zhiwei Luo; Changqin Quan

doi:10.1186/s13640-018-0262-1

EURASIP Journal on Image and Video Processing (Apr 2018)

Coarse-to-fine online learning for hand segmentation in egocentric video

Ying Zhao,
Zhiwei Luo,
Changqin Quan

Affiliations

Ying Zhao: Ricoh Software Research Center (Beijing) Co., Ltd
Zhiwei Luo: Graduate School of System Informatics, Kobe University
Changqin Quan: Graduate School of System Informatics, Kobe University

DOI: https://doi.org/10.1186/s13640-018-0262-1
Journal volume & issue: Vol. 2018, no. 1
pp. 1 – 12

Abstract

Read online

Abstract Hand segmentation is one of the most fundamental and crucial steps for egocentric human-computer interaction. The special egocentric view brings new challenges to hand segmentation tasks, such as the unpredictable environmental conditions. The performance of traditional hand segmentation methods depend on abundant manually labeled training data. However, these approaches do not appropriately capture the whole properties of egocentric human-computer interaction for neglecting the user-specific context. It is only necessary to build a personalized hand model of the active user. Based on this observation, we propose an online-learning hand segmentation approach without using manually labeled data for training. Our approach consists of top-down classifications and bottom-up optimizations. More specifically, we divide the segmentation task into three parts, a frame-level hand detection which detects the presence of the interactive hand using motion saliency and initializes hand masks for online learning, a superpixel-level hand classification which coarsely segments hand regions from which stable samples are selected for next level, and a pixel-level hand classification which produces a fine-grained hand segmentation. Based on the pixel-level classification result, we update the hand appearance model and optimize the upper layer classifier and detector. This online-learning strategy makes our approach robust to varying illumination conditions and hand appearances. Experimental results demonstrate the robustness of our approach.

Published in EURASIP Journal on Image and Video Processing

ISSN: 1687-5176 (Print); 1687-5281 (Online)
Publisher: SpringerOpen
Country of publisher: United Kingdom
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering: Electronics
Website: https://jivp-eurasipjournals.springeropen.com

About the journal

Abstract

Keywords