Generating Cryptographic S-Boxes Using the Reinforcement Learning

Giyoon Kim; Hangi Kim; Yeachan Heo; Yongjin Jeon; Jongsung Kim

doi:10.1109/ACCESS.2021.3085861

IEEE Access (Jan 2021)

Generating Cryptographic S-Boxes Using the Reinforcement Learning

Giyoon Kim,
Hangi Kim,
Yeachan Heo,
Yongjin Jeon,
Jongsung Kim

Affiliations

Giyoon Kim: Department of Mathematics and Financial Information Security, Kookmin University, Seoul, South Korea
Hangi Kim: Department of Mathematics and Financial Information Security, Kookmin University, Seoul, South Korea
Yeachan Heo: ORCiD; Gyeongsang National University High School (GNUHS), Jinju, South Korea
Yongjin Jeon: ORCiD; Department of Mathematics and Financial Information Security, Kookmin University, Seoul, South Korea
Jongsung Kim: ORCiD; Department of Mathematics and Financial Information Security, Kookmin University, Seoul, South Korea

DOI: https://doi.org/10.1109/ACCESS.2021.3085861
Journal volume & issue: Vol. 9
pp. 83092 – 83104

Abstract

Read online

Substitution boxes (S-boxes) are essential components of many cryptographic primitives. The Dijkstra algorithm, SAT solvers, and heuristic methods have been used to find bitsliced implementations of S-boxes. However, it is difficult to apply these methods for 8-bit S-boxes because of their size. Therefore, to implement these S-boxes so that the countermeasure of side-channel attack can be applied efficiently, using structures such as Feistel, Lai-Massey, and MISTY that can be bitsliced implemented with a small number of nonlinear operations has been widely used. Since S-boxes constructed with structures consist of small S-boxes and have specific designs, there are limitations to their cryptographic security and efficiency. In this paper, we propose a new method for generating S-boxes by stacking bitwise operations from the identity function, an approach that is different from existing methods. This method can be expressed in Markov decision process, and reinforcement learning is a suitable solver for Markov decision process. Our goal is to train this method to an agent through reinforcement learning to generate S-boxes to which the masking scheme, which is a countermeasure of side-channel attack, can be efficiently applied. In particular, our method provided various S-boxes superior or comparable to existing S-boxes. We produced 8-bit S-boxes with differential uniformity 16 (resp. 32) and linearity 128 (resp. 128), generated with nine (resp. eight) nonlinear operations, for the first time. To our best knowledge, this is the first study to construct cryptographic S-Box by incorporating reinforcement learning.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords