Machine Learning-Based Identification of the Strongest Predictive Variables of Winning and Losing in Belgian Professional Soccer

This study aimed to identify the strongest predictive variables of winning and losing in the highest Belgian soccer division. A predictive machine learning model based on a broad range of variables (n = 100) was constructed, using a dataset consisting of 576 games. To avoid multicollinearity and reduce dimensionality, Variance Inflation Factor (threshold of 5) and BorutaShap were respectively applied. A total of 13 variables remained and were used to predict winning or losing using Extreme Gradient Boosting. TreeExplainer was applied to determine feature importance on a global and local level.... Mehr ...

Verfasser: Youri Geurkink
Jan Boone
Steven Verstockt
Jan G. Bourgois
Dokumenttyp: Artikel
Erscheinungsdatum: 2021
Reihe/Periodikum: Applied Sciences, Vol 11, Iss 5, p 2378 (2021)
Verlag/Hrsg.: MDPI AG
Schlagwörter: association football / performance / performance analysis / KPI / game result / Technology / T / Engineering (General). Civil engineering (General) / TA1-2040 / Biology (General) / QH301-705.5 / Physics / QC1-999 / Chemistry / QD1-999
Sprache: Englisch
Permalink: https://search.fid-benelux.de/Record/base-26523998
Datenquelle: BASE; Originalkatalog
Powered By: BASE
Link(s) : https://doi.org/10.3390/app11052378