Scientific Reports (Dec 2024)
A multi modal fusion coal gangue recognition method based on IBWO-CNN-LSTM
Abstract
Abstract Accurate identification of coal and gangue is a crucial guarantee for efficient and safe mining of top coal caving face. This article proposes a coal-gangue recognition method based on an improved beluga whale optimization algorithm (IBWO), convolutional neural network, and long short-term memory network (CNN-LSTM) multi-modal fusion model. First, the mutation and memory library mechanisms are introduced into the beluga whale optimization to explore the solution space fully, prevent falling into local optimum, and accelerate the convergence process. Subsequently, the image mapping of the audio signal and vibration signal is performed to extract Mel-Frequency Cepstral Coefficients (MFCC) features, generating rich sample data for CNN-LSTM. Then the multi-head attention mechanism is introduced into CNN-LSTM to speed up the training speed and improve the classification accuracy. Finally, the IBWO-CNN-LSTM coal-gangue recognition model is constructed by the optimal hyperparameter combination obtained by IBWO to realize the automatic recognition of coal-gangue. The benchmark function proves that IBWO is superior to other optimization algorithms. By building an experimental platform for the impact of coal and gangue falling on the tail beam of hydraulic support, multiple experimental data collection is carried out. The experimental results show that the proposed coal-gangue recognition model has better performance than other recognition models, and the accuracy rate reaches 95.238%. The multi-modal fusion strategy helps to improve the accuracy and robustness of coal-gangue recognition.
Keywords