Mathematics (Oct 2022)
Persistent Homology for Breast Tumor Classification Using Mammogram Scans
Abstract
An important tool in the field of topological data analysis is persistent homology (PH), which is used to encode abstract representations of the homology of data at different resolutions in the form of persistence barcode (PB). Normally, one will obtain one PB from a digital image when using a sublevel-set filtration method. In this work, we built more than one PB representation of a single image based on a landmark selection method, known as local binary patterns (LBP), which encode different types of local texture from a digital image. Starting from the top-left corner of any 3-by-3 patch selected from an input image, the LBP process starts by subtracting the central pixel value from its eight neighboring pixel values. Then, each cell is assigned with 1 if the subtraction outcome is positive, and 0 otherwise, to obtain an 8-bit binary representation. This process will identify a set of landmark pixels to represent 0-simplices and use Vietoris–Rips filtration to obtain its corresponding PB. Using LBP, we can construct up to 56 PBs from a single image if we restrict to only using the binary codes that have two circular transitions between 1 and 0. The information within these 56 PBs contain detailed local and global topological and geometrical information, which can be used to design effective machine learning models. We used four different PB vectorizations, namely, persistence landscapes, persistence images, Betti curves (barcode binning), and PB statistics. We tested the effectiveness of the proposed landmark-based PH on two publicly available breast abnormality detection datasets using mammogram scans. The sensitivity and specificity of the landmark-based PH obtained was over 90% and 85%, respectively, in both datasets for the detection of abnormal breast scans. Finally, the experimental results provide new insights on using different PB vectorizations with sublevel set filtrations and landmark-based Vietoris–Rips filtration from digital mammogram scans.
Keywords