Intelligent Control/Operational Strategies in WWTPs through an Integrated Q-Learning Algorithm with ASM2d-Guided Reward

Jiwei Pang; Shanshan Yang; Lei He; Yidi Chen; Nanqi Ren

doi:10.3390/w11050927

Water (May 2019)

Intelligent Control/Operational Strategies in WWTPs through an Integrated Q-Learning Algorithm with ASM2d-Guided Reward

Jiwei Pang,
Shanshan Yang,
Lei He,
Yidi Chen,
Nanqi Ren

Affiliations

Jiwei Pang: State Key Laboratory of Urban Water Resource and Environment, Harbin Institute of Technology, Harbin 150000, China
Shanshan Yang: State Key Laboratory of Urban Water Resource and Environment, Harbin Institute of Technology, Harbin 150000, China
Lei He: State Key Laboratory of Urban Water Resource and Environment, Harbin Institute of Technology, Harbin 150000, China
Yidi Chen: State Key Laboratory of Urban Water Resource and Environment, Harbin Institute of Technology, Harbin 150000, China
Nanqi Ren: State Key Laboratory of Urban Water Resource and Environment, Harbin Institute of Technology, Harbin 150000, China

DOI: https://doi.org/10.3390/w11050927
Journal volume & issue: Vol. 11, no. 5
p. 927

Abstract

Read online

The operation of a wastewater treatment plant (WWTP) is a typical complex control problem, with nonlinear dynamics and coupling effects among the variables, which renders the implementation of real-time optimal control an enormous challenge. In this study, a Q-learning algorithm with activated sludge model No. 2d-guided (ASM2d-guided) reward setting (an integrated ASM2d-QL algorithm) is proposed, and the widely applied anaerobic-anoxic-oxic (AAO) system is chosen as the research paradigm. The integrated ASM2d-QL algorithms equipped with a self-learning mechanism are derived for optimizing the control strategies (hydraulic retention time (HRT) and internal recycling ratio (IRR)) of the AAO system. To optimize the control strategies of the AAO system under varying influent loads, Q matrixes were built for both HRTs and IRR optimization through the pair of <max reward-action> based on the integrated ASM2d-QL algorithm. 8 days of actual influent qualities of a certain municipal AAO wastewater treatment plant in June were arbitrarily chosen as the influent concentrations for model verification. Good agreement between the values of the model simulations and experimental results indicated that this proposed integrated ASM2d-QL algorithm performed properly and successfully realized intelligent modeling and stable optimal control strategies under fluctuating influent loads during wastewater treatment.

Published in Water

ISSN: 2073-4441 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Hydraulic engineering; Technology: Environmental technology. Sanitary engineering: Water supply for domestic and industrial purposes
Website: http://www.mdpi.com/journal/water/

About the journal

Abstract

Keywords