Multiple-base Logarithmic Quantization and Application in Reduced Precision AI Computations

Vassil Dimitrov; Richard Ford; Laurent Imbert; Arjuna Madanayake; Nilan Udayanga; Will Wray

doi:10.55630/dipp.2024.14.5

Digital Presentation and Preservation of Cultural and Scientific Heritage (Sep 2024)

Multiple-base Logarithmic Quantization and Application in Reduced Precision AI Computations

Vassil Dimitrov,
Richard Ford,
Laurent Imbert,
Arjuna Madanayake,
Nilan Udayanga,
Will Wray

Affiliations

Vassil Dimitrov: Lemurian Labs, Oakville, Canada; University of Calgary, Calgary, Canada
Richard Ford: Lemurian Labs, Oakville, Canada
Laurent Imbert: Lemurian Labs, Oakville, Canada; LIRMM, CNRS, University of Montpellier, Montpellier, France
Arjuna Madanayake: Lemurian Labs, Oakville, Canada
Nilan Udayanga: Lemurian Labs, Oakville, Canada
Will Wray: Lemurian Labs, Oakville, Canada

DOI: https://doi.org/10.55630/dipp.2024.14.5
Journal volume & issue: Vol. 14

Abstract

Read online

The power of logarithmic quantizations and computations has been recognized as a useful tool in optimizing the performance of large ML models. There are plenty of applications of ML techniques in digital preservation. The accuracy of computations may play a crucial role in the corresponding algorithms. In this article, we provide results that demonstrate significantly better quantization signal-to-noise ratio performance thanks to multiple-base logarithmic number systems (MDLNS) in comparison with the floating point quantizations that use the same number of bits. On a hardware level, we present details about our Xilinx VCU-128 FPGA design for dot product and matrix vector computations. The MDLNS matrix-vector design significantly outperforms equivalent fixed-point binary designs in terms of area (A) and time (T) complexity and power consumption as evidenced by a 4 × scaling of AT 2 metric for VLSI performance, and 57% increase in computational throughput per watt compared to fixed-point arithmetic.

not defined

Published in Digital Presentation and Preservation of Cultural and Scientific Heritage

ISSN: 1314-4006 (Print); 2535-0366 (Online)
Publisher: Bulgarian Academy of Sciences, Institute of Mathematics and Informatics
Country of publisher: Bulgaria
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Information technology
Website: https://dipp.math.bas.bg/

About the journal

Abstract

Keywords