Determining Thresholds for Optimal Adaptive Discrete Cosine Transformation

Alexander Khanov; Anastasija Shulzhenko; Anzhelika Voroshilova; Alexander Zubarev; Timur Karimov; Shakeeb Fahmi

doi:10.3390/a17080366

Algorithms (Aug 2024)

Determining Thresholds for Optimal Adaptive Discrete Cosine Transformation

Alexander Khanov,
Anastasija Shulzhenko,
Anzhelika Voroshilova,
Alexander Zubarev,
Timur Karimov,
Shakeeb Fahmi

Affiliations

Alexander Khanov: Computer-Aided Design Department, St. Petersburg Electrotechnical University “LETI”, 5 Professora Popova St., 197022 Saint Petersburg, Russia
Anastasija Shulzhenko: Information Security Department, St. Petersburg Electrotechnical University “LETI”, 5 Professora Popova St., 197022 Saint Petersburg, Russia
Anzhelika Voroshilova: School of Public Administration and Entrepreneurship, Institute of Economics and Management, Ural Federal University Named after the First President of Russia B.N.Yeltsin, 51 Lenina Ave., 620075 Yekaterinburg, Russia
Alexander Zubarev: Department of Electrical Engineering, St. Petersburg Electrotechnical University “LETI”, 5 Professora Popova St., 197022 Saint Petersburg, Russia
Timur Karimov: Computer-Aided Design Department, St. Petersburg Electrotechnical University “LETI”, 5 Professora Popova St., 197022 Saint Petersburg, Russia
Shakeeb Fahmi: Computer-Aided Design Department, St. Petersburg Electrotechnical University “LETI”, 5 Professora Popova St., 197022 Saint Petersburg, Russia

DOI: https://doi.org/10.3390/a17080366
Journal volume & issue: Vol. 17, no. 8
p. 366

Abstract

Read online

The discrete cosine transform (DCT) is widely used for image and video compression. Lossy algorithms such as JPEG, WebP, BPG and many others are based on it. Multiple modifications of DCT have been developed to improve its performance. One of them is adaptive DCT (ADCT) designed to deal with heterogeneous image structure and it may be found, for example, in the HEVC video codec. Adaptivity means that the image is divided into an uneven grid of squares: smaller ones retain information about details better, while larger squares are efficient for homogeneous backgrounds. The practical use of adaptive DCT algorithms is complicated by the lack of optimal threshold search algorithms for image partitioning procedures. In this paper, we propose a novel method for optimal threshold search in ADCT using a metric based on tonal distribution. We define two thresholds: pm, the threshold defining solid mean coloring, and ps, defining the quadtree fragment splitting. In our algorithm, the values of these thresholds are calculated via polynomial functions of the tonal distribution of a particular image or fragment. The polynomial coefficients are determined using the dedicated optimization procedure on the dataset containing images from the specific domain, urban road scenes in our case. In the experimental part of the study, we show that ADCT allows a higher compression ratio compared to non-adaptive DCT at the same level of quality loss, up to 66% for acceptable quality. The proposed algorithm may be used directly for image compression, or as a core of video compression framework in traffic-demanding applications, such as urban video surveillance systems.

Published in Algorithms

ISSN: 1999-4893 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.mdpi.com/journal/algorithms

About the journal

Abstract

Keywords