Novel Deep Reinforcement Algorithm With Adaptive Sampling Strategy for Continuous Portfolio Optimization

Szu-Hao Huang; Yu-Hsiang Miao; Yi-Ting Hsiao

doi:10.1109/ACCESS.2021.3082186

IEEE Access (Jan 2021)

Novel Deep Reinforcement Algorithm With Adaptive Sampling Strategy for Continuous Portfolio Optimization

Szu-Hao Huang,
Yu-Hsiang Miao,
Yi-Ting Hsiao

Affiliations

Szu-Hao Huang: ORCiD; Department of Information Management and Finance, National Yang Ming Chiao Tung University, Hsinchu, Taiwan
Yu-Hsiang Miao: Institute of Information Management, National Yang Ming Chiao Tung University, Hsinchu, Taiwan
Yi-Ting Hsiao: ORCiD; Institute of Information Management, National Yang Ming Chiao Tung University, Hsinchu, Taiwan

DOI: https://doi.org/10.1109/ACCESS.2021.3082186
Journal volume & issue: Vol. 9
pp. 77371 – 77385

Abstract

Read online

Quantitative trading targets favorable returns by determining patterns in historical data through statistical or mathematical approaches. With advances in artificial intelligence, many studies have indicated that deep reinforcement learning (RL) can perform well in quantitative trading by predicting price change trends in the financial market. However, most of the related frameworks display poor generalizability in the testing stage. Thus, we incorporated adversarial learning and a novel sampling strategy for RL portfolio management. The goal was to construct a portfolio comprising five assets from the constituents of the Dow Jones Industrial Average and to achieve excellent performance through our trading strategy. We used adversarial learning during the RL process to enhance the model’s robustness. Moreover, to improve the model’s computational efficiency, we introduced a novel sampling strategy to determine which data are worth learning by observing the learning condition. The experimental results revealed that the model with our sampling strategy had more favorable performance than the random learning strategy. The Sharpe ratio increased by 6 %–7 %, and profit increased by nearly 45 %. Thus, our proposed learning framework and the sampling strategy we employed are conducive to obtaining reliable trading rules.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords