Toward Joint Acquisition-Annotation of Images with Egocentric Devices for a Lower-Cost Machine Learning Application to Apple Detection

Salma Samiei; Pejman Rasti; Paul Richard; Gilles Galopin; David Rousseau

doi:10.3390/s20154173

Sensors (Jul 2020)

Toward Joint Acquisition-Annotation of Images with Egocentric Devices for a Lower-Cost Machine Learning Application to Apple Detection

Salma Samiei,
Pejman Rasti,
Paul Richard,
Gilles Galopin,
David Rousseau

Affiliations

Salma Samiei: Laboratoire Angevin de Recherche en Ingénierie des Systèmes (LARIS), Université d’Angers, 62 Avenue Notre Dame du Lac, 49035 Angers, France
Pejman Rasti: Laboratoire Angevin de Recherche en Ingénierie des Systèmes (LARIS), Université d’Angers, 62 Avenue Notre Dame du Lac, 49035 Angers, France
Paul Richard: Laboratoire Angevin de Recherche en Ingénierie des Systèmes (LARIS), Université d’Angers, 62 Avenue Notre Dame du Lac, 49035 Angers, France
Gilles Galopin: UMR 1345 Institut de Recherche en Horticulture et Semences (IRHS), INRAe, 42 Rue Georges Morel, 49071 Beaucouzé, France
David Rousseau: Laboratoire Angevin de Recherche en Ingénierie des Systèmes (LARIS), Université d’Angers, 62 Avenue Notre Dame du Lac, 49035 Angers, France

DOI: https://doi.org/10.3390/s20154173
Journal volume & issue: Vol. 20, no. 15
p. 4173

Abstract

Read online

Since most computer vision approaches are now driven by machine learning, the current bottleneck is the annotation of images. This time-consuming task is usually performed manually after the acquisition of images. In this article, we assess the value of various egocentric vision approaches in regard to performing joint acquisition and automatic image annotation rather than the conventional two-step process of acquisition followed by manual annotation. This approach is illustrated with apple detection in challenging field conditions. We demonstrate the possibility of high performance in automatic apple segmentation (Dice 0.85), apple counting (88 percent of probability of good detection, and 0.09 true-negative rate), and apple localization (a shift error of fewer than 3 pixels) with eye-tracking systems. This is obtained by simply applying the areas of interest captured by the egocentric devices to standard, non-supervised image segmentation. We especially stress the importance in terms of time of using such eye-tracking devices on head-mounted systems to jointly perform image acquisition and automatic annotation. A gain of time of over 10-fold by comparison with classical image acquisition followed by manual image annotation is demonstrated.

Published in Sensors

ISSN: 1424-8220 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Chemical technology
Website: http://www.mdpi.com/journal/sensors

About the journal

Abstract

Keywords