Deep Instance Segmentation and Visual Servoing to Play Jenga with a Cost-Effective Robotic System

Luca Marchionna; Giulio Pugliese; Mauro Martini; Simone Angarano; Francesco Salvetti; Marcello Chiaberge

doi:10.3390/s23020752

Sensors (Jan 2023)

Deep Instance Segmentation and Visual Servoing to Play Jenga with a Cost-Effective Robotic System

Luca Marchionna,
Giulio Pugliese,
Mauro Martini,
Simone Angarano,
Francesco Salvetti,
Marcello Chiaberge

Affiliations

Luca Marchionna: Department of Electronics and Telecommunications (DET), Politecnico di Torino, 10129 Torino, Italy
Giulio Pugliese: Department of Electronics and Telecommunications (DET), Politecnico di Torino, 10129 Torino, Italy
Mauro Martini: Department of Electronics and Telecommunications (DET), Politecnico di Torino, 10129 Torino, Italy
Simone Angarano: Department of Electronics and Telecommunications (DET), Politecnico di Torino, 10129 Torino, Italy
Francesco Salvetti: Department of Electronics and Telecommunications (DET), Politecnico di Torino, 10129 Torino, Italy
Marcello Chiaberge: Department of Electronics and Telecommunications (DET), Politecnico di Torino, 10129 Torino, Italy

DOI: https://doi.org/10.3390/s23020752
Journal volume & issue: Vol. 23, no. 2
p. 752

Abstract

Read online

The game of Jenga is a benchmark used for developing innovative manipulation solutions for complex tasks. Indeed, it encourages the study of novel robotics methods to successfully extract blocks from a tower. A Jenga game involves many traits of complex industrial and surgical manipulation tasks, requiring a multi-step strategy, the combination of visual and tactile data, and the highly precise motion of a robotic arm to perform a single block extraction. In this work, we propose a novel, cost-effective architecture for playing Jenga with e.Do, a 6DOF anthropomorphic manipulator manufactured by Comau, a standard depth camera, and an inexpensive monodirectional force sensor. Our solution focuses on a visual-based control strategy to accurately align the end-effector with the desired block, enabling block extraction by pushing. To this aim, we trained an instance segmentation deep learning model on a synthetic custom dataset to segment each piece of the Jenga tower, allowing for visual tracking of the desired block’s pose during the motion of the manipulator. We integrated the visual-based strategy with a 1D force sensor to detect whether the block could be safely removed by identifying a force threshold value. Our experimentation shows that our low-cost solution allows e.DO to precisely reach removable blocks and perform up to 14 consecutive extractions in a row.

Published in Sensors

ISSN: 1424-8220 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Chemical technology
Website: http://www.mdpi.com/journal/sensors

About the journal

Abstract

Keywords