MobVGG: Ensemble technique for birds and drones prediction
Sheikh Muhammad Saqib,
Tehseen Mazhar,
Muhammad Iqbal,
Ahmad Almogren,
Tariq Shahazad,
Ateeq Ur Rehman,
Habib Hamam
Affiliations
Sheikh Muhammad Saqib
Department of Computing and Information Technology, Gomal University, Dera Ismail Khan, 29220, Pakistan
Tehseen Mazhar
Department of Computer Science, School Education Department, Government of Punjab, Layyah 31200, Pakistan; Corresponding author. Department of Computer Science, School Education Department, Government of Punjab, Layyah 31200, Pakistan.
Muhammad Iqbal
Department of Computing and Information Technology, Gomal University, Dera Ismail Khan, 29220, Pakistan
Ahmad Almogren
Chair of Cyber Security, Department of Computer Science, College of Computer and Information Sciences, King Saud University, Riyadh 11633, Saudi Arabia
Tariq Shahazad
Department of Computer Science, COMSATS University Islamabad, Sahiwal Campus, Sahiwal, 57000, Pakistan
Ateeq Ur Rehman
School of Computing, Gachon University, Seongnam, 13120, Republic of Korea; Corresponding author.
Habib Hamam
Faculty of Engineering, Uni de Moncton, Moncton, NB E1A3E9, Canada; School of Electrical Engineering, University of Johannesburg, Johannesburg 2006, South Africa; International Institute of Technology and Management (IITG), Av. Grandes Ecoles, Libreville BP 1989, Gabon; Bridges for Academic Excellence - Spectrum, Tunis, Tunisia
Detection of aerial activities, including drones and birds, has practical implications for automating bird surveys and developing radar systems for aerial object collision detection. Convolutional neural networks (CNNs) have been extensively utilized for image recognition and classification tasks, albeit prior research predominantly focuses on single-class 'drone' classification. However, a gap persists in achieving high accuracy for multi-class classification. To address the limitations of traditional CNNs, such as vanishing gradients and the necessity for numerous layers, this study introduces a novel model termed ''MobVGG.” This model combines the architectures of MobileNetV2 and VGG16 to accurately classify images as either 'bird' or 'drone'. The dataset comprises 4212 images for each category of 'bird' and 'drone'. The stringent methodology was applied for dataset preparation and model training to ensure the reliability of the findings. Comparative analysis with previous research demonstrates that the proposed MobVGG model, trained on both 'bird' and 'drone' images, achieves superior accuracy (96 %) compared to benchmark studies. Our paper targets researchers and graduate students as its primary audience.