Robust Video List Decoding in Error-Prone Transmission Systems Using a Deep Learning Approach

Yujing Zhang; Stephane Coulombe; Francois-Xavier Coudoux; Alexis Guichemerre; Patrick Corlay

doi:10.1109/ACCESS.2024.3501152

IEEE Access (Jan 2024)

Robust Video List Decoding in Error-Prone Transmission Systems Using a Deep Learning Approach

Yujing Zhang,
Stephane Coulombe,
Francois-Xavier Coudoux,
Alexis Guichemerre,
Patrick Corlay

Affiliations

Yujing Zhang: ORCiD; Department of Software and IT Engineering, École de technologie supérieure, Université du Québec, Montréal, QC, Canada
Stephane Coulombe: ORCiD; Department of Software and IT Engineering, École de technologie supérieure, Université du Québec, Montréal, QC, Canada
Francois-Xavier Coudoux: ORCiD; CNRS, UMR 8520, Département d’Opto-Acousto-Électronique (DOAE), Institut d’Électronique de Microélectronique et de Nanotechnologie (IEMN), Université Polytechnique Hauts-de-France, Valenciennes, France
Alexis Guichemerre: ORCiD; Department of Software and IT Engineering, École de technologie supérieure, Université du Québec, Montréal, QC, Canada
Patrick Corlay: ORCiD; CNRS, UMR 8520, Département d’Opto-Acousto-Électronique (DOAE), Institut d’Électronique de Microélectronique et de Nanotechnologie (IEMN), Université Polytechnique Hauts-de-France, Valenciennes, France

DOI: https://doi.org/10.1109/ACCESS.2024.3501152
Journal volume & issue: Vol. 12
pp. 170632 – 170647

Abstract

Read online

This paper introduces a novel deep-learning assisted video list decoding method for error-prone video transmission systems. Unlike traditional list decoding techniques, our proposed system uses a Transformer-based no-reference image quality assessment method to select the highest-scoring reconstructed video candidate after reception. Three new components are defined and used in the Transformer-assisted image quality evaluation metric: neighborhood-based patch fidelity aggregation, discriminant color texture transformation and ranking-constrained penalty loss function. We have also created our own database of non-uniformly distorted images, similar to those that might result from transmission errors, in a High Efficiency Video Coding (HEVC) context. In our specific testing context, our improved Transformer-assisted method has a decision accuracy of 100% for intra-coded image, while, for errors occurring in an inter image, it is 96%. Notably, in the few cases where a wrong choice is made, the selected candidate’s quality remains similar to the intact frame. Code: https://github.com/Yujing0926/Robust-Video-List-Decoding-Using-a-Deep-Learning-Approach.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords