Water (Jan 2019)

Water Price Prediction for Increasing Market Efficiency Using Random Forest Regression: A Case Study in the Western United States

  • Ziyao Xu,
  • Jijian Lian,
  • Lingling Bin,
  • Kaixun Hua,
  • Kui Xu,
  • Hoi Yi Chan

DOI
https://doi.org/10.3390/w11020228
Journal volume & issue
Vol. 11, no. 2
p. 228

Abstract

Read online

The existence of water markets establishes water prices, promoting trading of water from low- to high-valued uses. However, market participants can face uncertainty when asking and offering prices because water rights are heterogeneous, resulting in inefficiency of the market. This paper proposes three random forest regression models (RFR) to predict water price in the western United States: a full variable set model and two reduced ones with optimal numbers of variables using a backward variable elimination (BVE) approach. Transactions of 12 semiarid states, from 1987 to 2009, and a dataset containing various predictors, were assembled. Multiple replications of k-fold cross-validation were applied to assess the model performance and their generalizability was tested on unused data. The importance of price influencing factors was then analyzed based on two plausible variable importance rankings. Results show that the RFR models have good predictive power for water price. They outperform a baseline model without leading to overfitting. Also, the higher degree of accuracy of the reduced models is insignificant, reflecting the robustness of RFR to including lower informative variables. This study suggests that, due to its ability to automatically learn from and make predictions on data, RFR-based models can aid water market participants in making more efficient decisions.

Keywords