Reinforcement Learning for Two-Stage Permutation Flow Shop Scheduling&#x2014;A Real-World Application in Household Appliance Production

Arthur Muller; Felix Grumbach; Fiona Kattenstroth

doi:10.1109/ACCESS.2024.3355269

IEEE Access (Jan 2024)

Reinforcement Learning for Two-Stage Permutation Flow Shop Scheduling—A Real-World Application in Household Appliance Production

Arthur Muller,
Felix Grumbach,
Fiona Kattenstroth

Affiliations

Arthur Muller: ORCiD; Fraunhofer IOSB-INA, Lemgo, Germany
Felix Grumbach: ORCiD; Center for Applied Data Science (CfADS), Bielefeld University of Applied Sciences, Gütersloh, Germany
Fiona Kattenstroth: Miele & Cie.KG, Gütersloh, Germany

DOI: https://doi.org/10.1109/ACCESS.2024.3355269
Journal volume & issue: Vol. 12
pp. 11388 – 11399

Abstract

Read online

Solving production scheduling problems is a difficult and indispensable task for manufacturers with a push-oriented planning approach. In this study, we tackle a novel production scheduling problem from a household appliance production at the company Miele & Cie. KG, namely a two-stage permutation flow shop scheduling problem (PFSSP) with a finite buffer and sequence-dependent setup efforts. The objective is to minimize idle times and setup efforts in lexicographic order. In extensive and realistic data, the identification of exact solutions is not possible due to the combinatorial complexity. Therefore, we developed a reinforcement learning (RL) approach based on the Proximal Policy Optimization (PPO) algorithm that integrates domain knowledge through reward shaping, action masking, and curriculum learning to solve this PFSSP. Benchmarking of our approach with a state-of-the-art genetic algorithm (GA) showed significant superiority. Our work thus provides a successful example of the applicability of RL in real-world production planning, demonstrating not only its practical utility but also showing the technical and methodological integration of the agent with a discrete event simulation (DES). We also conducted experiments to investigate the impact of individual algorithmic elements and a hyperparameter of the reward function on the overall solution.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords