An Efficient Video Coding System With an Adaptive Overfitted Multi-Scale Attention Network

Gang He; Chang Wu; Li Xu; Lei Li; Ziyao Xu; Weiying Xie; Yunsong Li

doi:10.1109/access.2021.3075623

IEEE Access (Jan 2021)

An Efficient Video Coding System With an Adaptive Overfitted Multi-Scale Attention Network

Gang He,
Chang Wu,
Li Xu,
Lei Li,
Ziyao Xu,
Weiying Xie,
Yunsong Li

Affiliations

Gang He: ORCiD; State Key Laboratory of Integrated Services Networks, School of Telecommunications Engineering, Xidian University, Xi’an, China
Chang Wu: ORCiD; State Key Laboratory of Integrated Services Networks, School of Telecommunications Engineering, Xidian University, Xi’an, China
Li Xu: State Key Laboratory of Integrated Services Networks, School of Telecommunications Engineering, Xidian University, Xi’an, China
Lei Li: State Key Laboratory of Integrated Services Networks, School of Telecommunications Engineering, Xidian University, Xi’an, China
Ziyao Xu: State Key Laboratory of Integrated Services Networks, School of Telecommunications Engineering, Xidian University, Xi’an, China
Weiying Xie: ORCiD; State Key Laboratory of Integrated Services Networks, School of Telecommunications Engineering, Xidian University, Xi’an, China
Yunsong Li: ORCiD; State Key Laboratory of Integrated Services Networks, School of Telecommunications Engineering, Xidian University, Xi’an, China

DOI: https://doi.org/10.1109/access.2021.3075623
Journal volume & issue: Vol. 9
pp. 64022 – 64032

Abstract

Read online

We herein propose an efficient video coding system (EVCS) consists of a conventional codec and an adaptive overfitted multi-scale attention network (MSAN) to improve coding efficiency. At the encoder side, the MSAN adjusts the network size adaptively and is trained in an overfitting way for a group of frames. Using only the current encoding video stream as a training set, the MSAN can easily obtain powerful restoration capability. After training, the learned parameters of the MSAN are transmitted to the decoder as part of the encoded bitstream. At the decoder side, the MSAN loaded with the transmitted parameters can restore the reconstructed frames very meticulously. Compared with the high efficiency video coding (HEVC) standard, the EVCS can achieve 12.141% Bjøntegaard-Delta bitrate reduction, which outperforms existing deep learning based compressed video restoration works with less computation complexity. Moreover, the MSAN is an additional part of the conventional codec without any structure change, thereby rendering it compatible with the existing coding systems.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords