Deep learning-powered visual place recognition for enhanced mobile multimedia communication in autonomous transport systems

Roopa Devi E. M; T. Abirami; Ashit Kumar Dutta; Shtwai Alsubai

Alexandria Engineering Journal (Dec 2024)

Deep learning-powered visual place recognition for enhanced mobile multimedia communication in autonomous transport systems

Roopa Devi E. M,
T. Abirami,
Ashit Kumar Dutta,
Shtwai Alsubai

Affiliations

Roopa Devi E. M: Department of Information Technology, Kongu Engineering College, Perundurai, Erode 638060, India; Corresponding author.
T. Abirami: Department of Computer Science and Information Systems, College of Applied Sciences, AlMaarefa University, Ad Diriyah, Riyadh 13713, Saudi Arabia
Ashit Kumar Dutta: Department of Computer Science, College of Computer Engineering and Sciences in Al-Kharj, Prince Sattam bin Abdulaziz University, P.O. Box 151, Al-Kharj 11942, Saudi Arabia
Shtwai Alsubai: Department of Information Technology, Kongu Engineering College, Perundurai, Erode 638060, India; Department of Computer Science and Information Systems, College of Applied Sciences, AlMaarefa University, Ad Diriyah, Riyadh 13713, Saudi Arabia; Department of Computer Science, College of Computer Engineering and Sciences in Al-Kharj, Prince Sattam bin Abdulaziz University, P.O. Box 151, Al-Kharj 11942, Saudi Arabia

Journal volume & issue: Vol. 109
pp. 950 – 962

Abstract

Read online

The progress of autonomous transport systems (ATS) involves efficient multimedia communication for real-time data tradeoffs and environmental issues. Deep learning (DL) powered visual place recognition (VPR) was developed as an effective tool to improve mobile multimedia communication in ATS. VPR relates to the capability of a method or device to recognize and identify particular places or locations from the visual scene. This procedure involves inspecting visual data, like images or video frames, to control the unique features or features connected with diverse locations. By leveraging camera sensors, VPR allows vehicles to detect their surroundings, enabling context-aware communication and enhancing the entire system's performance. DL-empowered VPR offers a transformative manner to improve mobile multimedia communication in ATS. By identifying and understanding their situation, autonomous vehicles can communicate most effectively and operate reliably and safely, paving the way for a future characterized by seamless and intelligent transportation. This article develops a novel Deep Learning-Powered Visual Place Recognition for Enhanced Multimedia Communication in Autonomous Transport Systems (DLVPR-MCATS) methodology. The main aim of the DLVPR-MCATS methodology is to recognize visual places or not utilize optimal DL approaches. For this purpose, the DLVPR-MCATS approach utilizes a bilateral filtering (BF) based preprocessing model. For the feature fusion model, the DLVPR-MCATS approach follows three models: residual network (ResNet), EfficientNet, and MobileNetv2. Moreover, the hyperparameter tuning method uses the Harris Hawks Optimization (HHO) model. Finally, the bidirectional long short-term memory (BiLSTM) technique is implemented to recognize visual places. A wide range of simulations is executed to validate the solution of the DLVPR-MCATS method. The experimental validation of the DLVPR-MCATS method portrayed a superior performance over other models concerning various aspects.

Published in Alexandria Engineering Journal

ISSN: 1110-0168 (Print); 2090-2670 (Online)
Publisher: Elsevier
Country of publisher: Egypt
LCC subjects: Technology: Engineering (General). Civil engineering (General)
Website: http://www.journals.elsevier.com/alexandria-engineering-journal/

About the journal

Abstract

Keywords