Journal of Engineering and Sustainable Development (Nov 2024)
Unsupervised Histopathological Sub-Image Analysis for Breast Cancer Diagnosis Using Variational Autoencoders, Clustering, and Supervised Learning
Abstract
This paper presents an integrated approach to breast cancer diagnosis that combines unsupervised and supervised learning techniques. The method involves using a pre-trained VGG19 model to process sub-images from the BreaKHis dataset, divided into nine parts for comprehensive analysis. This will be followed by a complete description of the architecture and workings of the variational Autoencoder (VAE) used for unsupervised Learning. The encoder network maps the input features to lower dimensions, capturing the most essential information. VAE learns a compressed representation of sub-images, facilitating a more profound understanding of underlying patterns and structures. For this reason, we then employ k-means clustering on the encoded representation to find naturally occurring clusters in our data set comprising a histopathological image. Every single sub-image is later fed into the VGG19-SVM model for classification purposes. During magnification at 100x, this model has attained a fantastic accuracy rate of 98.56%. Combining unsupervised analysis with VAE/k-means clustering and supervised classification with VGG19/SVM can integrate information from both methods, thereby improving the accuracy and robustness of such a task as sub-image classification in breast cancer histopathology.
Keywords