Integrating Audio Signal Processing and Deep Learning Algorithms for Gait Pattern Classification in Brazilian Gaited Horses

Anderson Antonio Carvalho Alves; Lucas Tassoni Andrietta; Rafael Zinni Lopes; Fernando Oliveira Bussiman; Fabyano Fonseca e Silva; Roberto Carvalheiro; Roberto Carvalheiro; Luiz Fernando Brito; Júlio César de Carvalho Balieiro; Lucia Galvão Albuquerque; Lucia Galvão Albuquerque; Ricardo Vieira Ventura

doi:10.3389/fanim.2021.681557

Frontiers in Animal Science (Aug 2021)

Integrating Audio Signal Processing and Deep Learning Algorithms for Gait Pattern Classification in Brazilian Gaited Horses

Anderson Antonio Carvalho Alves,
Lucas Tassoni Andrietta,
Rafael Zinni Lopes,
Fernando Oliveira Bussiman,
Fabyano Fonseca e Silva,
Roberto Carvalheiro,
Roberto Carvalheiro,
Luiz Fernando Brito,
Júlio César de Carvalho Balieiro,
Lucia Galvão Albuquerque,
Lucia Galvão Albuquerque,
Ricardo Vieira Ventura

Affiliations

Anderson Antonio Carvalho Alves: Department of Education, Federal Institute of Education, Science and Technology of Maranhão (IFMA), São Raimundo das Mangabeiras, Brazil
Lucas Tassoni Andrietta: Department of Animal Nutrition and Production, School of Veterinary Medicine and Animal Science, University of São Paulo, Pirassununga, Brazil
Rafael Zinni Lopes: Department of Animal Nutrition and Production, School of Veterinary Medicine and Animal Science, University of São Paulo, Pirassununga, Brazil
Fernando Oliveira Bussiman: Department of Animal Nutrition and Production, School of Veterinary Medicine and Animal Science, University of São Paulo, Pirassununga, Brazil
Fabyano Fonseca e Silva: Department of Animal Science, Federal University of Viçosa, Viçosa, Brazil
Roberto Carvalheiro: Department of Animal Science, School of Agricultural and Veterinary Sciences, Säo Paulo State University (UNESP), Jaboticabal, Brazil
Roberto Carvalheiro: National Council for Scientific and Technological Development (CNPq), Brasilia, Brazil
Luiz Fernando Brito: Department of Animal Sciences, Purdue University, West Lafayette, IN, United States
Júlio César de Carvalho Balieiro: Department of Animal Nutrition and Production, School of Veterinary Medicine and Animal Science, University of São Paulo, Pirassununga, Brazil
Lucia Galvão Albuquerque: Department of Animal Science, School of Agricultural and Veterinary Sciences, Säo Paulo State University (UNESP), Jaboticabal, Brazil
Lucia Galvão Albuquerque: National Council for Scientific and Technological Development (CNPq), Brasilia, Brazil
Ricardo Vieira Ventura: Department of Animal Nutrition and Production, School of Veterinary Medicine and Animal Science, University of São Paulo, Pirassununga, Brazil

DOI: https://doi.org/10.3389/fanim.2021.681557
Journal volume & issue: Vol. 2

Abstract

Read online

This study focused on assessing the usefulness of using audio signal processing in the gaited horse industry. A total of 196 short-time audio files (4 s) were collected from video recordings of Brazilian gaited horses. These files were converted into waveform signals (196 samples by 80,000 columns) and divided into training (N = 164) and validation (N = 32) datasets. Twelve single-valued audio features were initially extracted to summarize the training data according to the gait patterns (Marcha Batida—MB and Marcha Picada—MP). After preliminary analyses, high-dimensional arrays of the Mel Frequency Cepstral Coefficients (MFCC), Onset Strength (OS), and Tempogram (TEMP) were extracted and used as input information in the classification algorithms. A principal component analysis (PCA) was performed using the 12 single-valued features set and each audio-feature dataset—AFD (MFCC, OS, and TEMP) for prior data visualization. Machine learning (random forest, RF; support vector machine, SVM) and deep learning (multilayer perceptron neural networks, MLP; convolution neural networks, CNN) algorithms were used to classify the gait types. A five-fold cross-validation scheme with 10 repetitions was employed for assessing the models' predictive performance. The classification performance across models and AFD was also validated with independent observations. The models and AFD were compared based on the classification accuracy (ACC), specificity (SPEC), sensitivity (SEN), and area under the curve (AUC). In the logistic regression analysis, five out of the 12 audio features extracted were significant (p < 0.05) between the gait types. ACC averages ranged from 0.806 to 0.932 for MFCC, from 0.758 to 0.948 for OS and, from 0.936 to 0.968 for TEMP. Overall, the TEMP dataset provided the best classification accuracies for all models. The most suitable method for audio-based horse gait pattern classification was CNN. Both cross and independent validation schemes confirmed that high values of ACC, SPEC, SEN, and AUC are expected for yet-to-be-observed labels, except for MFCC-based models, in which clear overfitting was observed. Using audio-generated data for describing gait phenotypes in Brazilian horses is a promising approach, as the two gait patterns were correctly distinguished. The highest classification performance was achieved by combining CNN and the rhythmic-descriptive AFD.

Published in Frontiers in Animal Science

ISSN: 2673-6225 (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Agriculture: Animal culture: Veterinary medicine
Website: https://www.frontiersin.org/journals/animal-science

About the journal

Abstract

Keywords