Use of Machine Learning and Deep Learning to Predict the Outcomes of Major League Baseball Matches

Mei-Ling Huang; Yun-Zhi Li

doi:10.3390/app11104499

Applied Sciences (May 2021)

Use of Machine Learning and Deep Learning to Predict the Outcomes of Major League Baseball Matches

Mei-Ling Huang,
Yun-Zhi Li

Affiliations

Mei-Ling Huang: Department of Industrial Engineering and Management, National Chin-Yi University of Technology, Taichung 41170, Taiwan
Yun-Zhi Li: Department of Industrial Engineering and Management, National Chin-Yi University of Technology, Taichung 41170, Taiwan

DOI: https://doi.org/10.3390/app11104499
Journal volume & issue: Vol. 11, no. 10
p. 4499

Abstract

Read online

Major League Baseball (MLB) is the highest level of professional baseball in the world and accounts for some of the most popular international sporting events. Many scholars have conducted research on predicting the outcome of MLB matches. The accuracy in predicting the results of baseball games is low. Therefore, deep learning and machine learning methods were used to build models for predicting the outcomes (win/loss) of MLB matches and investigate the differences between the models in terms of their performance. The match data of 30 teams during the 2019 MLB season with only the starting pitcher or with all pitchers in the pitcher category were collected to compare the prediction accuracy. A one-dimensional convolutional neural network (1DCNN), a traditional machine learning artificial neural network (ANN), and a support vector machine (SVM) were used to predict match outcomes with fivefold cross-validation to evaluate model performance. The highest prediction accuracies were 93.4%, 93.91%, and 93.90% with the 1DCNN, ANN, SVM models, respectively, before feature selection; after feature selection, the highest accuracies obtained were 94.18% and 94.16% with the ANN and SVM models, respectively. The prediction results obtained with the three models were similar, and the prediction accuracies were much higher than those obtained in related studies. Moreover, a 1DCNN was used for the first time for predicting the outcome of MLB matches, and it achieved a prediction accuracy similar to that achieved by machine learning methods.

Published in Applied Sciences

ISSN: 2076-3417 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Engineering (General). Civil engineering (General); Science: Biology (General); Science: Physics; Science: Chemistry
Website: http://www.mdpi.com/journal/applsci

About the journal

Abstract

Keywords