PeerJ Computer Science (Feb 2024)

Categorization of tweets for damages: infrastructure and human damage assessment using fine-tuned BERT model

  • Muhammad Shahid Iqbal Malik,
  • Muhammad Zeeshan Younas,
  • Mona Mamdouh Jamjoom,
  • Dmitry I. Ignatov

DOI
https://doi.org/10.7717/peerj-cs.1859
Journal volume & issue
Vol. 10
p. e1859

Abstract

Read online Read online

Identification of infrastructure and human damage assessment tweets is beneficial to disaster management organizations as well as victims during a disaster. Most of the prior works focused on the detection of informative/situational tweets, and infrastructure damage, only one focused on human damage. This study presents a novel approach for detecting damage assessment tweets involving infrastructure and human damages. We investigated the potential of the Bidirectional Encoder Representations from Transformer (BERT) model to learn universal contextualized representations targeting to demonstrate its effectiveness for binary and multi-class classification of disaster damage assessment tweets. The objective is to exploit a pre-trained BERT as a transfer learning mechanism after fine-tuning important hyper-parameters on the CrisisMMD dataset containing seven disasters. The effectiveness of fine-tuned BERT is compared with five benchmarks and nine comparable models by conducting exhaustive experiments. The findings show that the fine-tuned BERT outperformed all benchmarks and comparable models and achieved state-of-the-art performance by demonstrating up to 95.12% macro-f1-score, and 88% macro-f1-score for binary and multi-class classification. Specifically, the improvement in the classification of human damage is promising.

Keywords