A fuzzy rank-based ensemble of CNN models for classification of cervical cytology

Ankur Manna; Rohit Kundu; Dmitrii Kaplun; Aleksandr Sinitca; Ram Sarkar

doi:10.1038/s41598-021-93783-8

Scientific Reports (Jul 2021)

A fuzzy rank-based ensemble of CNN models for classification of cervical cytology

Ankur Manna,
Rohit Kundu,
Dmitrii Kaplun,
Aleksandr Sinitca,
Ram Sarkar

Affiliations

Ankur Manna: Department of Computer Science and Engineering, Jadavpur University
Rohit Kundu: Department of Electrical Engineering, Jadavpur University
Dmitrii Kaplun: Department of Automation and Control Processes, Saint Petersburg Electrotechnical University “LETI”
Aleksandr Sinitca: Department of Automation and Control Processes, Saint Petersburg Electrotechnical University “LETI”
Ram Sarkar: Department of Computer Science and Engineering, Jadavpur University

DOI: https://doi.org/10.1038/s41598-021-93783-8
Journal volume & issue: Vol. 11, no. 1
pp. 1 – 18

Abstract

Read online

Abstract Cervical cancer affects more than 0.5 million women annually causing more than 0.3 million deaths. Detection of cancer in its early stages is of prime importance for eradicating the disease from the patient’s body. However, regular population-wise screening of cancer is limited by its expensive and labour intensive detection process, where clinicians need to classify individual cells from a stained slide consisting of more than 100,000 cervical cells, for malignancy detection. Thus, Computer-Aided Diagnosis (CAD) systems are used as a viable alternative for easy and fast detection of cancer. In this paper, we develop such a method where we form an ensemble-based classification model using three Convolutional Neural Network (CNN) architectures, namely Inception v3, Xception and DenseNet-169 pre-trained on ImageNet dataset for Pap stained single cell and whole-slide image classification. The proposed ensemble scheme uses a fuzzy rank-based fusion of classifiers by considering two non-linear functions on the decision scores generated by said base learners. Unlike the simple fusion schemes that exist in the literature, the proposed ensemble technique makes the final predictions on the test samples by taking into consideration the confidence in the predictions of the base classifiers. The proposed model has been evaluated on two publicly available benchmark datasets, namely, the SIPaKMeD Pap Smear dataset and the Mendeley Liquid Based Cytology (LBC) dataset, using a 5-fold cross-validation scheme. On the SIPaKMeD Pap Smear dataset, the proposed framework achieves a classification accuracy of 98.55% and sensitivity of 98.52% in its 2-class setting, and 95.43% accuracy and 98.52% sensitivity in its 5-class setting. On the Mendeley LBC dataset, the accuracy achieved is 99.23% and sensitivity of 99.23%. The results obtained outperform many of the state-of-the-art models, thereby justifying the effectiveness of the same. The relevant codes of this proposed model are publicly available on GitHub .

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal