CM-UNet: ConvMixer UNet for Segmentation of Unknown Objects in Cluttered Scenes

Xiaoqian Huang; Rana Azzam; Sajid Javed; Dongming Gan; Lakmal Seneviratne; Abdelqader Abusafieh; Yahya Zweiri

doi:10.1109/ACCESS.2022.3224588

IEEE Access (Jan 2022)

CM-UNet: ConvMixer UNet for Segmentation of Unknown Objects in Cluttered Scenes

Xiaoqian Huang,
Rana Azzam,
Sajid Javed,
Dongming Gan,
Lakmal Seneviratne,
Abdelqader Abusafieh,
Yahya Zweiri

Affiliations

Xiaoqian Huang: ORCiD; Advanced Research and Innovation Center (ARIC), Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates
Rana Azzam: ORCiD; Khalifa University Center for Autonomous Robotic Systems (KUCARS), Khalifa University, Abu Dhabi, United Arab Emirates
Sajid Javed: ORCiD; Khalifa University Center for Autonomous Robotic Systems (KUCARS), Khalifa University, Abu Dhabi, United Arab Emirates
Dongming Gan: ORCiD; School of Engineering Technology, Purdue University, West Lafayette, IN, USA
Lakmal Seneviratne: ORCiD; Khalifa University Center for Autonomous Robotic Systems (KUCARS), Khalifa University, Abu Dhabi, United Arab Emirates
Abdelqader Abusafieh: SVP Technology and Advanced Materials, Strata Manufacturing PJSC, Al Ain, United Arab Emirates
Yahya Zweiri: ORCiD; Advanced Research and Innovation Center (ARIC), Khalifa University of Science and Technology, Abu Dhabi, United Arab Emirates

DOI: https://doi.org/10.1109/ACCESS.2022.3224588
Journal volume & issue: Vol. 10
pp. 123622 – 123633

Abstract

Read online

Object segmentation in cluttered environments is a fundamental pre-processing step for many perception-related tasks such as vision-based robotic grasping. Most of the existing object segmentation methods are incapable of precisely segmenting unknown objects, particularly in scenarios exhibiting significant occlusion. In this paper, we propose a novel approach for refining the segmentation of unknown objects in cluttered scenes. More specifically, a ConvMixer-based UNet model is designed to enhance the segmentation mask and boundary of unknown objects appearing in cluttered scenes. In our model, we leverage the object’s semantic and localization information, which are essential for successful segmentation, using a ConvMixer-based Cross Fusion (CMCF) module. Furthermore, we propose to use patch embedding as a pre-processing step, where input data is rearranged to expedite processing and improve the efficiency of the system. CM-UNet was trained and extensively tested on various challenging publicly available datasets, including unknown objects in un-structured scenes. Thorough evaluations, in terms of segmentation accuracy and processing efficiency, were conducted against state-of-the-art solutions, where the superiority of our model was proven. CM-UNet has shown its ability to efficiently improve the segmentation accuracy of unknown objects in cluttered scenes, even in presence of occlusion.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords