Scientific Data (Aug 2024)

A 10-m scale chemical industrial parks map along the Yangtze River in 2021 based on machine learning

  • Wenming Song,
  • Mingxing Chen,
  • Zhipeng Tang

DOI
https://doi.org/10.1038/s41597-024-03674-6
Journal volume & issue
Vol. 11, no. 1
pp. 1 – 10

Abstract

Read online

Abstract Strengthening industrial pollution control in the Yangtze River is a fundamental national policy of China. There is a lack of detailed distribution of chemical industrial parks (CIPs). This Study utilized random forest (RF) and active learning to generate the distribution map of CIPs along the Yangtze River at 10-m resolution. Based on Sentinel-2 imagery, spectral and texture features are extracted. Combined with the Points of Interest (POI), a multidimensional feature space is constructed. By employing partitioned training, classification of CIPs map is achieved on Google Earth Engine (GEE). Technical validation along the entire Yangtze River demonstrates a model accuracy of 80%. Compared to traditional manual survey methods, this approach saves significant time and economic costs while also being timelier. As the first publicly available CIPs map within a 5-km range along the Yangtze River, this research will provide a scientific basis for the fine governance of chemical industries in the region. Additionally, it offers a model guide for the accurate identification of the chemical industry.