Label dependency modeling in Multi-Label Naïve Bayes through input space expansion

PKA Chitra; Saravana Balaji Balasubramanian; Omar Khattab; Mhd Omar Al-Kadri

doi:10.7717/peerj-cs.2093

PeerJ Computer Science (Dec 2024)

Label dependency modeling in Multi-Label Naïve Bayes through input space expansion

PKA Chitra,
Saravana Balaji Balasubramanian,
Omar Khattab,
Mhd Omar Al-Kadri

Affiliations

PKA Chitra: Department of Information Technology, Rathinam Group of Institutions, Coimbatore, Tamil Nadu, India
Saravana Balaji Balasubramanian: Department of Computing, De Montfort University Kazakhstan, Almaty, Kazakhstan
Omar Khattab: Department of Computer Science and Engineering, Kuwait College of Science and Technology, Kuwait
Mhd Omar Al-Kadri: College of Computing and Information Technology, University of Doha for Science and Technology, Doha, Qatar

DOI: https://doi.org/10.7717/peerj-cs.2093
Journal volume & issue: Vol. 10
p. e2093

Abstract

Read online Read online

In the realm of multi-label learning, instances are often characterized by a plurality of labels, diverging from the single-label paradigm prevalent in conventional datasets. Multi-label techniques often employ a similar feature space to build classification models for every label. Nevertheless, labels typically convey distinct semantic information and should possess their own unique attributes. Several approaches have been suggested to identify label-specific characteristics for creating distinct categorization models. Our proposed methodology seeks to encapsulate and systematically represent label correlations within the learning framework. The innovation of improved multi-label Naïve Bayes (iMLNB) lies in its strategic expansion of the input space, which assimilates meta information derived from the label space, thereby engendering a composite input domain that encompasses both continuous and categorical variables. To accommodate the heterogeneity of the expanded input space, we refine the likelihood parameters of iMLNB using a joint density function, which is adept at handling the amalgamation of data types. We subject our enhanced iMLNB model to a rigorous empirical evaluation, utilizing six benchmark datasets. The performance of our approach is gauged against the traditional multi-label Naïve Bayes (MLNB) algorithm and is quantified through a suite of evaluation metrics. The empirical results not only affirm the competitive edge of our proposed method over the conventional MLNB but also demonstrate its superiority across the aforementioned metrics. This underscores the efficacy of modeling label dependencies in multi-label learning environments and positions our approach as a significant contribution to the field.

Published in PeerJ Computer Science

ISSN: 2376-5992 (Online)
Publisher: PeerJ Inc.
Country of publisher: United States
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://peerj.com/computer-science/

About the journal

Abstract

Keywords