Modified Fast Inverse Square Root and Square Root Approximation Algorithms: The Method of Switching Magic Constants

Leonid V. Moroz; Volodymyr V. Samotyy; Oleh Y. Horyachyy

doi:10.3390/computation9020021

Computation (Feb 2021)

Modified Fast Inverse Square Root and Square Root Approximation Algorithms: The Method of Switching Magic Constants

Leonid V. Moroz,
Volodymyr V. Samotyy,
Oleh Y. Horyachyy

Affiliations

Leonid V. Moroz: Information Technologies Security Department, Lviv Polytechnic National University, 79013 Lviv, Ukraine
Volodymyr V. Samotyy: Automation and Information Technologies Department, Cracow University of Technology, 31155 Cracow, Poland
Oleh Y. Horyachyy: Information Technologies Security Department, Lviv Polytechnic National University, 79013 Lviv, Ukraine

DOI: https://doi.org/10.3390/computation9020021
Journal volume & issue: Vol. 9, no. 2
p. 21

Abstract

Read online

Many low-cost platforms that support floating-point arithmetic, such as microcontrollers and field-programmable gate arrays, do not include fast hardware or software methods for calculating the square root and/or reciprocal square root. Typically, such functions are implemented using direct lookup tables or polynomial approximations, with a subsequent application of the Newton–Raphson method. Other, more complex solutions include high-radix digit-recurrence and bipartite or multipartite table-based methods. In contrast, this article proposes a simple modification of the fast inverse square root method that has high accuracy and relatively low latency. Algorithms are given in C/C++ for single- and double-precision numbers in the IEEE 754 format for both square root and reciprocal square root functions. These are based on the switching of magic constants in the initial approximation, depending on the input interval of the normalized floating-point numbers, in order to minimize the maximum relative error on each subinterval after the first iteration—giving 13 correct bits of the result. Our experimental results show that the proposed algorithms provide a fairly good trade-off between accuracy and latency after two iterations for numbers of type float, and after three iterations for numbers of type double when using fused multiply–add instructions—giving almost complete accuracy.

Published in Computation

ISSN: 2079-3197 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://www.mdpi.com/journal/computation

About the journal

Abstract

Keywords