Computer Methods and Programs in Biomedicine Update (Jan 2023)
Semi-supervised active transfer learning for fetal ECG arrhythmia detection
Abstract
Deep learning has demonstrated excellent results for ECG anomaly detection, wherein most approaches used supervised learning. The requirement of thousands of manually annotated samples is a concern for state-of-the-art anomaly detection systems, especially for fetal ECG (FECG), and currently, there is not a publicly available FECG dataset annotated for each FECG beat. In this paper, we offer a modified active learning technique based on transfer learning, calibration probability, and autoencoder-based sampling to reduce number of samples requires to annotate. In this regard, we used 25,000 s of recording from 47 patients from the MIT-BIH Arrhythmia Database to train a deep learning model to detect anomalies in non-fetus subjects. Then we used the unlabeled Non-Invasive Fetal ECG Arrhythmia Database (NIFEA DB) of 26 subjects to fine-tune the trained model to fine-tune the trained model based on active learning to detect anomalies in binary form for fetal. A variational autoencoder is trained on all data (adult and fetal ECG), and clustering is applied to latent features extracted from data after dimension reduction. Then, the sampling process of active learning selected samples from different clusters with low confidence to cover all data distribution. Moreover, a probability calibration based on mc-dropout and isotonic regression is used to calibrate confidences, helping to select reliable low-confidence samples. Various ablation studies were performed to show the influence of autoencoder-based sampling, calibration, and transfer learning, which showed that the proposed method could achieve 92% accuracy using 399 training samples. In contrast, other methods required more training samples to reach the same level of accuracy without calibration or an autoencoder and clustering approach or training without active learning. The study also found that transfer learning significantly impacted faster convergence and that the proposed active learning approach was more effective than traditional methods.