Effectiveness of machine learning methods for water segmentation with ROI as the label: A case study of the Tuul River in Mongolia

Kai Li; Juanle Wang; Jinyi Yao

International Journal of Applied Earth Observations and Geoinformation (Dec 2021)

Effectiveness of machine learning methods for water segmentation with ROI as the label: A case study of the Tuul River in Mongolia

Kai Li,
Juanle Wang,
Jinyi Yao

Affiliations

Kai Li: State Key Laboratory of Resources and Environmental Information System, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China; College of Geoscience and Surveying Engineering, China University of Mining & Technology (Beijing), Beijing 100083, China
Juanle Wang: State Key Laboratory of Resources and Environmental Information System, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China; Corresponding author.
Jinyi Yao: State Key Laboratory of Resources and Environmental Information System, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China; School of Civil and Architectural Engineering, Shandong University of Technology, Zibo 255049, China

Journal volume & issue: Vol. 103
p. 102497

Abstract

Read online

The carrying capacity of water resources is key to the sustainable development of arid and semi-arid regions. There are important challenges related to the detection of discontinuous and crooked water bodies in the vast Mongolian Plateau, despite the availability of remote sensing technology which has the advantage of facilitating water observations over large areas and timelines. Given the high cost and low coverage of high-resolution images and the low resolution of images with high coverage, this study proposes a pixel-based convolutional neural network (CNN) method for the application of water extracted from the region of interest (ROI) to medium-resolution Landsat images. The pixel-based CNN method combines the texture and spectral features of the ground object by connecting the center pixels of the images to the surrounding pixels. ROI is used instead of full-label datasets, reduce the difficulty of building labels in low-to-medium-resolution images. Taking the Tuul River in Mongolia as a case, the pixel-based CNN method, the normalized difference water index threshold (NDWI) method, the modified normalized difference water index (MNDWI) threshold method, U-net model in deep learning, and the pixel-based deep neural network (DNN) method were used with medium-resolution Landsat 8 images with ROI labels. The pixel-based CNN method shows better water extraction results for the cloud, cloud shadows, and building areas, compared with other methods. The method proposed in this study had the highest verification accuracy (92.07%). It also has the advantages of fewer training parameters and shorter training time. The training accuracies of the pixel-based CNN, pixel-based DNN, and U-net were 99.90%, 96.98%, and 93.70%, respectively. All training models and calling methods were uploaded to GitHub (https://github.com/CaryLee17/Pixel-based-CNN).

Published in International Journal of Applied Earth Observations and Geoinformation

ISSN: 1569-8432 (Print); 1872-826X (Online)
Publisher: Elsevier
Country of publisher: Netherlands
LCC subjects: Geography. Anthropology. Recreation: Physical geography; Geography. Anthropology. Recreation: Environmental sciences
Website: https://www.journals.elsevier.com/international-journal-of-applied-earth-observation-and-geoinformation

About the journal

Abstract

Keywords