Categorization of tweets for damages: infrastructure and human damage assessment using fine-tuned BERT model

Muhammad Shahid Iqbal Malik; Muhammad Zeeshan Younas; Mona Mamdouh Jamjoom; Dmitry I. Ignatov

doi:10.7717/peerj-cs.1859

PeerJ Computer Science (Feb 2024)

Categorization of tweets for damages: infrastructure and human damage assessment using fine-tuned BERT model

Muhammad Shahid Iqbal Malik,
Muhammad Zeeshan Younas,
Mona Mamdouh Jamjoom,
Dmitry I. Ignatov

Affiliations

Muhammad Shahid Iqbal Malik: Department of Computer Science, National Research University Higher School of Economics, Moscow, Russia
Muhammad Zeeshan Younas: Department of Computer Science, Capital University of Science and Technology, Islamabad, Pakistan
Mona Mamdouh Jamjoom: Department of Computer Sciences, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia
Dmitry I. Ignatov: Department of Computer Science, National Research University Higher School of Economics, Moscow, Russia

DOI: https://doi.org/10.7717/peerj-cs.1859
Journal volume & issue: Vol. 10
p. e1859

Abstract

Read online Read online

Identification of infrastructure and human damage assessment tweets is beneficial to disaster management organizations as well as victims during a disaster. Most of the prior works focused on the detection of informative/situational tweets, and infrastructure damage, only one focused on human damage. This study presents a novel approach for detecting damage assessment tweets involving infrastructure and human damages. We investigated the potential of the Bidirectional Encoder Representations from Transformer (BERT) model to learn universal contextualized representations targeting to demonstrate its effectiveness for binary and multi-class classification of disaster damage assessment tweets. The objective is to exploit a pre-trained BERT as a transfer learning mechanism after fine-tuning important hyper-parameters on the CrisisMMD dataset containing seven disasters. The effectiveness of fine-tuned BERT is compared with five benchmarks and nine comparable models by conducting exhaustive experiments. The findings show that the fine-tuned BERT outperformed all benchmarks and comparable models and achieved state-of-the-art performance by demonstrating up to 95.12% macro-f1-score, and 88% macro-f1-score for binary and multi-class classification. Specifically, the improvement in the classification of human damage is promising.

Published in PeerJ Computer Science

ISSN: 2376-5992 (Online)
Publisher: PeerJ Inc.
Country of publisher: United States
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://peerj.com/computer-science/

About the journal

Abstract

Keywords