Dynamic and Real-Time Object Detection Based on Deep Learning for Home Service Robots

Yangqing Ye; Xiaolon Ma; Xuanyi Zhou; Guanjun Bao; Weiwei Wan; Shibo Cai

doi:10.3390/s23239482

Sensors (Nov 2023)

Dynamic and Real-Time Object Detection Based on Deep Learning for Home Service Robots

Yangqing Ye,
Xiaolon Ma,
Xuanyi Zhou,
Guanjun Bao,
Weiwei Wan,
Shibo Cai

Affiliations

Yangqing Ye: College of Mechanical Engineering, Zhejiang University of Technology, Hangzhou 310023, China
Xiaolon Ma: College of Mechanical and Electrical Engineering, China Jiliang University, Hangzhou 310018, China
Xuanyi Zhou: College of Mechanical Engineering, Zhejiang University of Technology, Hangzhou 310023, China
Guanjun Bao: College of Mechanical Engineering, Zhejiang University of Technology, Hangzhou 310023, China
Weiwei Wan: Graduate School of Engineering Science, Osaka University, Suita 562-0045, Japan
Shibo Cai: College of Mechanical Engineering, Zhejiang University of Technology, Hangzhou 310023, China

DOI: https://doi.org/10.3390/s23239482
Journal volume & issue: Vol. 23, no. 23
p. 9482

Abstract

Read online

Home service robots operating indoors, such as inside houses and offices, require the real-time and accurate identification and location of target objects to perform service tasks efficiently. However, images captured by visual sensors while in motion states usually contain varying degrees of blurriness, presenting a significant challenge for object detection. In particular, daily life scenes contain small objects like fruits and tableware, which are often occluded, further complicating object recognition and positioning. A dynamic and real-time object detection algorithm is proposed for home service robots. This is composed of an image deblurring algorithm and an object detection algorithm. To improve the clarity of motion-blurred images, the DA-Multi-DCGAN algorithm is proposed. It comprises an embedded dynamic adjustment mechanism and a multimodal multiscale fusion structure based on robot motion and surrounding environmental information, enabling the deblurring processing of images that are captured under different motion states. Compared with DeblurGAN, DA-Multi-DCGAN had a 5.07 improvement in Peak Signal-to-Noise Ratio (PSNR) and a 0.022 improvement in Structural Similarity (SSIM). An AT-LI-YOLO method is proposed for small and occluded object detection. Based on depthwise separable convolution, this method highlights key areas and integrates salient features by embedding the attention module in the AT-Resblock to improve the sensitivity and detection precision of small objects and partially occluded objects. It also employs a lightweight network unit Lightblock to reduce the network’s parameters and computational complexity, which improves its computational efficiency. Compared with YOLOv3, the mean average precision (mAP) of AT-LI-YOLO increased by 3.19%, and the detection precision of small objects, such as apples and oranges and partially occluded objects, increased by 19.12% and 29.52%, respectively. Moreover, the model inference efficiency had a 7 ms reduction in processing time. Based on the typical home activities of older people and children, the dataset Grasp-17 was established for the training and testing of the proposed method. Using the TensorRT neural network inference engine of the developed service robot prototype, the proposed dynamic and real-time object detection algorithm required 29 ms, which meets the real-time requirement of smooth vision.

Published in Sensors

ISSN: 1424-8220 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Chemical technology
Website: http://www.mdpi.com/journal/sensors

About the journal

Abstract

Keywords