A Novel Grasp Detection Algorithm with Multi-Target Semantic Segmentation for a Robot to Manipulate Cluttered Objects

Xungao Zhong; Yijun Chen; Jiaguo Luo; Chaoquan Shi; Huosheng Hu

doi:10.3390/machines12080506

Machines (Jul 2024)

A Novel Grasp Detection Algorithm with Multi-Target Semantic Segmentation for a Robot to Manipulate Cluttered Objects

Xungao Zhong,
Yijun Chen,
Jiaguo Luo,
Chaoquan Shi,
Huosheng Hu

Affiliations

Xungao Zhong: School of Electrical Engineering and Automation, Xiamen University of Technology, Xiamen 361024, China
Yijun Chen: School of Electrical Engineering and Automation, Xiamen University of Technology, Xiamen 361024, China
Jiaguo Luo: School of Electrical Engineering and Automation, Xiamen University of Technology, Xiamen 361024, China
Chaoquan Shi: School of Electrical Engineering and Automation, Xiamen University of Technology, Xiamen 361024, China
Huosheng Hu: School of Computer Science and Electronic Engineering, University of Essex, Colchester CO4 3SQ, UK

DOI: https://doi.org/10.3390/machines12080506
Journal volume & issue: Vol. 12, no. 8
p. 506

Abstract

Read online

Objects in cluttered environments may have similar sizes and shapes, which remains a huge challenge for robot grasping manipulation. The existing segmentation methods, such as Mask R-CNN and Yolo-v8, tend to lose the shape details of objects when dealing with messy scenes, and this loss of detail limits the grasp performance of robots in complex environments. This paper proposes a high-performance grasp detection algorithm with a multi-target semantic segmentation model, which can effectively improve a robot’s grasp success rate in cluttered environments. The algorithm consists of two cascades: Semantic Segmentation and Grasp Detection modules (SS-GD), in which the backbone network of the semantic segmentation module is developed by using the state-of-the-art Swin Transformer structure. It can extract the detailed features of objects in cluttered environments and enable a robot to understand the position and shape of the candidate object. To construct the grasp schema SS-GD focused on important vision features, a grasp detection module is designed based on the Squeeze-and-Excitation (SE) attention mechanism, to predict the corresponding grasp configuration accurately. The grasp detection experiments were conducted on an actual UR5 robot platform to verify the robustness and generalization of the proposed SS-GD method in cluttered environments. A best grasp success rate of 91.7% was achieved for cluttered multi-target workspaces.

Published in Machines

ISSN: 2075-1702 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Mechanical engineering and machinery
Website: http://www.mdpi.com/journal/machines

About the journal

Abstract

Keywords