Model compression for real-time object detection using rigorous gradation pruning
Defu Yang,
Mahmud Iwan Solihin,
Yawen Zhao,
Bingyu Cai,
Chaoran Chen,
Andika Aji Wijaya,
Chun Kit Ang,
Wei Hong Lim
Affiliations
Defu Yang
Faculty of Engineering, Technology and Built Environment, UCSI University, Kuala Lumpur, Malaysia; Corresponding author
Mahmud Iwan Solihin
Faculty of Engineering, Technology and Built Environment, UCSI University, Kuala Lumpur, Malaysia; Corresponding author
Yawen Zhao
Faculty of Engineering, Technology and Built Environment, UCSI University, Kuala Lumpur, Malaysia
Bingyu Cai
Faculty of Engineering, Technology and Built Environment, UCSI University, Kuala Lumpur, Malaysia; School of Advanced Manufacturing, Shantou Polytechnic, Shantou, China
Chaoran Chen
Faculty of Engineering, Technology and Built Environment, UCSI University, Kuala Lumpur, Malaysia; School of Advanced Manufacturing, Shantou Polytechnic, Shantou, China
Andika Aji Wijaya
Department of Mechanical Engineering, University of Business & Technology, Jeddah, Saudi Arabia; Corresponding author
Chun Kit Ang
Faculty of Engineering, Technology and Built Environment, UCSI University, Kuala Lumpur, Malaysia
Wei Hong Lim
Faculty of Engineering, Technology and Built Environment, UCSI University, Kuala Lumpur, Malaysia
Summary: Achieving lightweight real-time object detection necessitates balancing model compression with detection accuracy, a difficulty exacerbated by low redundancy and uneven contributions from convolutional layers. As an alternative to traditional methods, we propose Rigorous Gradation Pruning (RGP), which uses a desensitized first-order Taylor approximation to assess filter importance, enabling precise pruning of redundant kernels. This approach includes the iterative reassessment of layer significance to protect essential layers, ensuring effective detection performance. We applied RGP to YOLOv8 detectors and tested it on GTSDB, Seaships, and COCO datasets. On GTSDB, RGP achieved 80% compression of YOLOv8n with only a 0.11% drop in mAP0.5, while increasing frames per second (FPS) by 43.84%. For YOLOv8x, RGP achieved 90% compression, a 1.26% mAP0.5:0.95 increase, and a 112.66% FPS boost. Significant compression was also achieved on Seaships and COCO datasets, demonstrating RGP’s robustness across diverse object detection tasks and its potential for advancing efficient, high-speed detection models.