THE EFFICIENCY OF QUALITATIVE DATA CLUSTERING IN MEDICAL AND SOCIO-ECONOMIC SURVEYS – COMPARATIVE STUDY

Cristina Gena Dascălu; Magda Ecaterina Antohe; Doriana Agop-Forna; Petruta Siminiuc; Norina Forna

doi:10.6261/RJOR.2024.2.16.47

Romanian Journal of Oral Rehabilitation (Jun 2024)

THE EFFICIENCY OF QUALITATIVE DATA CLUSTERING IN MEDICAL AND SOCIO-ECONOMIC SURVEYS – COMPARATIVE STUDY

Cristina Gena Dascălu,
Magda Ecaterina Antohe,
Doriana Agop-Forna,
Petruta Siminiuc,
Norina Forna

Affiliations

Cristina Gena Dascălu: ”Grigore T. Popa” U.M.Ph. - Iași, Romania, Faculty of Medicine, Medical Informatics Dept.
Magda Ecaterina Antohe: ”Grigore T. Popa” U.M.Ph. - Iași, Romania, Faculty of Dental Medicine, Implantology and Prosthetic Implant Rehabilitation Dept
Doriana Agop-Forna: ”Grigore T. Popa” U.M.Ph. - Iași, Romania, Faculty of Dental Medicine, Dento-alveolar and Maxillo-Facial Dept
Petruta Siminiuc: PhD Student, “Grigore T.Popa” University of Medicine and Pharmacy Iasi, Faculty of Dental Medicine, Romania
Norina Forna: ”Grigore T. Popa” U.M.Ph. - Iași, Romania, Faculty of Dental Medicine, Implantology and Prosthetic Implant Rehabilitation Dept

DOI: https://doi.org/10.6261/RJOR.2024.2.16.47
Journal volume & issue: Vol. 16, no. 2
pp. 518 – 526

Abstract

Read online

Clustering is a complex data mining tool, useful to identify similarities in large amount of data, the medical databases being highly suitable in this regard. Our paper aims to compare the efficacy of two well-known clustering methods, the n-means algorithm and the classical hierarchical algorithm, and to apply them in analyzing a medical-economic database on dietary habits, social economic status and oral health in a sample of 326 men, aged between 25 and 30, living in the urban area – in order to identify possible associations between dietary habits and income levels. We identified 4 clusters which correspond partially to the 4 income levels recorded in the investigated sample and reveal the associated dietary habits. The n-means clustering performed better than the Single Linkage hierarchical classification, being therefore highly suitable in the analysis of socio-economic and general health data.

Published in Romanian Journal of Oral Rehabilitation

ISSN: 2066-7000 (Print); 2601-4661 (Online)
Publisher: Romanian Society of Oral Rehabilitation
Country of publisher: Romania
LCC subjects: Medicine: Dentistry
Website: http://www.rjor.ro/

About the journal

Abstract

Keywords