Toward Tweet-Mining Framework for Extracting Terrorist Attack-Related Information and Reporting

Farkhund Iqbal; Rabia Batool; Benjamin C. M. Fung; Saiqa Aleem; Ahmed Abbasi; Abdul Rehman Javed

doi:10.1109/ACCESS.2021.3102040

IEEE Access (Jan 2021)

Toward Tweet-Mining Framework for Extracting Terrorist Attack-Related Information and Reporting

Farkhund Iqbal,
Rabia Batool,
Benjamin C. M. Fung,
Saiqa Aleem,
Ahmed Abbasi,
Abdul Rehman Javed

Affiliations

Farkhund Iqbal: ORCiD; College of Technological Innovation, Zayed University, Abu Dhabi, United Arab Emirates
Rabia Batool: College of Technological Innovation, Zayed University, Abu Dhabi, United Arab Emirates
Benjamin C. M. Fung: ORCiD; School of Information Studies, McGill University, Montreal, Canada
Saiqa Aleem: ORCiD; College of Technological Innovation, Zayed University, Abu Dhabi, United Arab Emirates
Ahmed Abbasi: Department of Cyber Security, Air University, Islamabad, Pakistan
Abdul Rehman Javed: ORCiD; Department of Cyber Security, Air University, Islamabad, Pakistan

DOI: https://doi.org/10.1109/ACCESS.2021.3102040
Journal volume & issue: Vol. 9
pp. 115535 – 115547

Abstract

Read online

The widespread popularity of social networking is leading to the adoption of Twitter as an information dissemination tool. Existing research has shown that information dissemination over Twitter has a much broader reach than traditional media and can be used for effective post-incident measures. People use informal language on Twitter, including acronyms, misspelled words, synonyms, transliteration, and ambiguous terms. This makes incident-related information extraction a non-trivial task. However, this information can be valuable for public safety organizations that need to respond in an emergency. This paper proposes an early event-related information extraction and reporting framework that monitors Twitter streams synthesizes event-specific information, e.g., a terrorist attack, and alerts law enforcement, emergency services, and media outlets. Specifically, the proposed framework, Tweet-to-Act (T2A), employs word embedding to transform tweets into a vector space model and then utilizes the Word Mover’s Distance (WMD) to cluster tweets for the identification of incidents. To extract reliable and valuable information from a large dataset of short and informal tweets, the proposed framework employs sequence labeling with bidirectional Long Short-Term Memory based Recurrent Neural Networks (bLSTM-RNN). Extensive experimental results suggest that our proposed framework, T2A, outperforms other state-of-the-art methods that use vector space modeling and distance calculation techniques, e.g., Euclidean and Cosine distance. T2A achieves an accuracy of 96% and an F1-score of 86.2% on real-life datasets.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords