DP-YOLO: Effective Improvement Based on YOLO Detector

Chao Wang; Qijin Wang; Yu Qian; Yating Hu; Ying Xue; Hongqiang Wang

doi:10.3390/app132111676

Applied Sciences (Oct 2023)

DP-YOLO: Effective Improvement Based on YOLO Detector

Chao Wang,
Qijin Wang,
Yu Qian,
Yating Hu,
Ying Xue,
Hongqiang Wang

Affiliations

Chao Wang: School of Electronic and Information Engineering, Anhui Jianzhu University, Hefei 230601, China
Qijin Wang: School of Big Data and Artificial Intelligence, Anhui Xinhua University, Hefei 230088, China
Yu Qian: School of Electronic and Information Engineering, Anhui Jianzhu University, Hefei 230601, China
Yating Hu: School of Electronic and Information Engineering, Anhui Jianzhu University, Hefei 230601, China
Ying Xue: School of Electronic and Information Engineering, Anhui Jianzhu University, Hefei 230601, China
Hongqiang Wang: Institute of Intelligent Machines, Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei 230031, China

DOI: https://doi.org/10.3390/app132111676
Journal volume & issue: Vol. 13, no. 21
p. 11676

Abstract

Read online

YOLOv5 remains one of the most widely used real-time detection models due to its commendable performance in accuracy and generalization. However, compared to more recent detectors, it falls short in label assignment and leaves significant room for optimization. Particularly, recognizing targets with varying shapes and poses proves challenging, and training the detector to grasp such features requires expert verification or collective discussion during the dataset labeling process, especially in domain-specific contexts. While deformable convolutions offer a partial solution, their extensive usage can enhance detection capabilities but at the expense of increased computational effort. We introduce DP-YOLO, an enhanced target detector that efficiently integrates the YOLOv5s backbone network with deformable convolutions. Our approach optimizes the positive sample selection during label assignment, resulting in a more scientifically grounded process. Notably, experiments on the COCO benchmark validate the efficacy of DP-YOLO, which utilizes an image size of [640, 640], achieves a remarkable 41.2 AP, and runs at an impressive 69 fps on an RTX 3090. Comparatively, DP-YOLO outperforms YOLOv5s by 3.2 AP, with only a small increase in parameters and GFLOPSs. These results demonstrate the significant advancements made by our proposed method.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords