Predictive Modelling for Sensitive Social Media Contents Using Entropy-FlowSort and Artificial Neural Networks Initialized by Large Language Models

Narcisan Galamiton; Suzette Bacus; Noreen Fuentes; Janeth Ugang; Rica Villarosa; Charldy Wenceslao; Lanndon Ocampo

doi:10.1007/s44196-024-00668-5

International Journal of Computational Intelligence Systems (Oct 2024)

Predictive Modelling for Sensitive Social Media Contents Using Entropy-FlowSort and Artificial Neural Networks Initialized by Large Language Models

Narcisan Galamiton,
Suzette Bacus,
Noreen Fuentes,
Janeth Ugang,
Rica Villarosa,
Charldy Wenceslao,
Lanndon Ocampo

Affiliations

Narcisan Galamiton: College of Computer, Information and Communications Technology, Cebu Technological University
Suzette Bacus: College of Computer, Information and Communications Technology, Cebu Technological University
Noreen Fuentes: College of Computer, Information and Communications Technology, Cebu Technological University
Janeth Ugang: College of Computer, Information and Communications Technology, Cebu Technological University
Rica Villarosa: Center for Applied Mathematics and Operations Research, Cebu Technological University
Charldy Wenceslao: Center for Applied Mathematics and Operations Research, Cebu Technological University
Lanndon Ocampo: Center for Applied Mathematics and Operations Research, Cebu Technological University

DOI: https://doi.org/10.1007/s44196-024-00668-5
Journal volume & issue: Vol. 17, no. 1
pp. 1 – 18

Abstract

Read online

Abstract This work offers an integrated methodological framework that integrates the capabilities of large language models (LLMs), rules-based reasoning, multi-criteria sorting, and artificial neural networks (ANN) in developing a predictive model for classifying the intensity of sensitive social media contents. The current literature lacks a holistic consideration of multiple attributes in evaluating social media contents, and the proposed framework intends to bridge such a gap. Three actions constitute the development of the framework. First, LLMs (i.e., GPT4) evaluate the social media contents under a predefined set of attributes, leveraging the power of LLMs in content analytics. Second, rules-based reasoning and multi-criteria sorting (i.e., entropy-FlowSort) determine the categories of social media contents. Lastly, the two previous actions produced a complete dataset that can be used to train a predictive model using ANN to classify sensitive social media contents. With 1100 randomly extracted social media contents and the predefined categories of violations against community standards set by Facebook, the proposed integrated methodology produces an ANN-based classification model with 86.36% prediction accuracy. Comparative analysis using Decision Trees, k-nearest neighbors, Linear Discriminant Analysis, Random Forest, and Naive Bayes classification yields the highest performance of ANN. The predictive model can be used as a decision-support tool to design moderation actions on social media contents.

Published in International Journal of Computational Intelligence Systems

ISSN: 1875-6891 (Print); 1875-6883 (Online)
Publisher: Springer
Country of publisher: Switzerland
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.springer.com/journal/44196

About the journal

Abstract

Keywords