An ensemble machine learning model to uncover potential sites of hazardous waste illegal dumping based on limited supervision experience

Jinghua Geng; Yimeng Ding; Wenjun Xie; Wen Fang; Miaomiao Liu; Zongwei Ma; Jianxun Yang; Jun Bi

Fundamental Research (Jul 2024)

An ensemble machine learning model to uncover potential sites of hazardous waste illegal dumping based on limited supervision experience

Jinghua Geng,
Yimeng Ding,
Wenjun Xie,
Wen Fang,
Miaomiao Liu,
Zongwei Ma,
Jianxun Yang,
Jun Bi

Affiliations

Jinghua Geng: State Key Laboratory of Pollution Control and Resource Reuse, School of the Environment, Nanjing University, Nanjing 210023 China
Yimeng Ding: State Key Laboratory of Pollution Control and Resource Reuse, School of the Environment, Nanjing University, Nanjing 210023 China
Wenjun Xie: State Key Laboratory of Pollution Control and Resource Reuse, School of the Environment, Nanjing University, Nanjing 210023 China
Wen Fang: Corresponding author.; State Key Laboratory of Pollution Control and Resource Reuse, School of the Environment, Nanjing University, Nanjing 210023 China
Miaomiao Liu: State Key Laboratory of Pollution Control and Resource Reuse, School of the Environment, Nanjing University, Nanjing 210023 China
Zongwei Ma: State Key Laboratory of Pollution Control and Resource Reuse, School of the Environment, Nanjing University, Nanjing 210023 China
Jianxun Yang: State Key Laboratory of Pollution Control and Resource Reuse, School of the Environment, Nanjing University, Nanjing 210023 China
Jun Bi: State Key Laboratory of Pollution Control and Resource Reuse, School of the Environment, Nanjing University, Nanjing 210023 China

Journal volume & issue: Vol. 4, no. 4
pp. 972 – 978

Abstract

Read online

With the soaring generation of hazardous waste (HW) during industrialization and urbanization, HW illegal dumping continues to be an intractable global issue. Particularly in developing regions with lax regulations, it has become a major source of soil and groundwater contamination. One dominant challenge for HW illegal dumping supervision is the invisibility of dumping sites, which makes HW illegal dumping difficult to be found, thereby causing a long-term adverse impact on the environment. How to utilize the limited historic supervision records to screen the potential dumping sites in the whole region is a key challenge to be addressed. In this study, a novel machine learning model based on the positive-unlabeled (PU) learning algorithm was proposed to resolve this problem through the ensemble method which could iteratively mine the features of limited historic cases. Validation of the random forest-based PU model showed that the predicted top 30% of high-risk areas could cover 68.1% of newly reported cases in the studied region, indicating the reliability of the model prediction. This novel framework will also be promising in other environmental management scenarios to deal with numerous unknown samples based on limited prior experience.

Published in Fundamental Research

ISSN: 2667-3258 (Online)
Publisher: KeAi Communications Co. Ltd.
Country of publisher: China
LCC subjects: Science: Science (General)
Website: https://www.keaipublishing.com/en/journals/fundamental-research/

About the journal

Abstract

Keywords