Informatics in Medicine Unlocked (Jan 2023)
Prediction of chronic liver disease patients using integrated projection based statistical feature extraction with machine learning algorithms
Abstract
The healthy liver plays more than 500 organic roles in the human body, while a malfunction may be dangerous or even deadly. Early diagnosis and treatment of liver disease can improve the likelihood of survival. Machine learning (ML) is a powerful tool that can assist healthcare professionals during the diagnostic process for a hepatic patient. The standard ML system includes the methods of data pre-processing, feature extraction, and classification. In the feature extraction stage, ML researchers frequently use projection-based feature extraction approaches to remove data redundancy, but this does not produce the desired results. In addition, most statistical projection methods have different purposes when projecting original features. The Indian liver patient dataset (ILPD) from the University of California, Irvin (UCI) repository is used in this study to classify chronic liver disease. The data set has 583 patient disease records; 416 patients have liver disease, and 167 do not. Using several projection methods, we proposed an integrated feature extraction approach to categorize liver patients. In the pipeline, the proposed method first imputes the missing values and outliers for pre-treatment. Then, integrated feature extraction applies the pre-processed data to extract the significant features for classification. A simulation study is also being conducted to strengthen the suggested methodology. The proposed approach incorporates several ML algorithms, including logistic regression (LR), random forest (RF), K-nearest neighbor (KNN), support vector machine (SVM), multilayer perceptron (MLP), and the ensemble voting classifier. The offered system has an accuracy of 88.10%, a precision of 85.33%, a recall of 92.30%, an F1 score of 88.68%, and an AUC score of 88.20% in predicting liver diseases. Our proposed technique yielded 0.10–18.5% better results than the latest existing studies. The findings suggest that the recommended system could be used to supplement a physician's diagnosis of liver disease.