SECTION III - SPORTS AND PHYSICAL ACTIVITY / RESEARCH PAPER
Predicting the Match Outcome in the 2023 FIFA Women’s World Cup and Analysis of Influential Features
 
More details
Hide details
1
United States Soccer Federation, Chicago, IL, United States.
 
2
CIDESD, Research Center in Sports Sciences, Health Sciences and Human Development, Department of Sport Sciences, University of Beira Interior, Covilhã, Portugal.
 
3
Gabbett Performance Solutions, Brisbane, QLD, Australia.
 
 
Submission date: 2024-04-23
 
 
Final revision date: 2024-06-25
 
 
Acceptance date: 2024-11-04
 
 
Online publication date: 2025-05-29
 
 
Corresponding author
José M. Oliva Lozano   

High Performance, United States Soccer Federation. Chicago, IL, United States, United States
 
 
 
KEYWORDS
TOPICS
ABSTRACT
The aim of this study was to build an XGBoost model to predict the match outcome and analyze match-related technical, tactical and physical performance features that may influence the predicted outcome of the match. This is an observational study which follows a retrospective design. The FIFA post-match summary reports were downloaded at the end of the 2023 Women’s World Cup and used to create a dataset which consisted of match-related technical, tactical and physical performance variables. Then, an XGBoost model was built to predict the match outcome and investigate which performance features might influence the predicted outcome of the match. The overall model achieved accuracy of 0.58 ± 0.05. Losses and wins had similar predictive accuracy (0.67 ± 0.06 and 0.67 ± 0.08, respectively), but the prediction of draws performed was significantly worse with accuracy of 0.32 ± 0.16. The top ten features for predicting wins were: (1) out to in actions by the opponent, (2) attempts at the goal, (3) in-behind actions, (4) interceptions by the opponent, (5) loose ball receptions, (6) sprinting per minute by the opponent, (7) offers received by the opponent, (8) in-front opponent, (9) interceptions, and (10) total distance per minute. The top ten features for predicting losses were: (1) attempts at the goal by the opponent, (2) interceptions, (3) out to in actions, (4) possessions interrupted, (5) loose ball receptions by the opponent, (6) in front movements, (7) distance covered by the opponent, (8) in-behind actions by the opponent, (9) total distance, and (10) sprinting per minute. In conclusion, using an XGBoost model, this is the first study to successfully predict the match outcome for wins and losses from the FIFA Women’s World Cup, but also explain which features significantly influence the prediction. This study may serve as a guide for practitioners regarding the use and application of XGBoost models in high performance.
REFERENCES (30)
1.
Andrzejewski, M., Oliva-Lozano, J. M., Chmura, P., Chmura, J., Czarniecki, S., Kowalczuk, E., Rokita, A., Muyor, J. M., & Konefał, M. (2022). Analysis of team success based on match technical and running performance in a professional soccer league. BMC Sports Science, Medicine and Rehabilitation, 14(82), 1–7. https://doi.org/10.1186/s13102....
 
2.
Atasever, G., & Kiyici, F. (2023). Analysis of match performance indicators of women soccer players in World Cups. The Online Journal of Recreation and Sports, 12(4), 824–828. https://doi.org/10.22282/tojra....
 
3.
Barthelemy, B., Ravé, G., Govindasamy, K., Ali, A., Del Coso, J., Demeaux, J., Bideau, B., & Zouhal, H. (2024). Impact of technical-tactical and physical performance on the match outcome in professional soccer: a case study. Journal of Human Kinetics, 94, 203–214. https://doi.org/10.5114/jhk/18....
 
4.
Bradley, P. (2024, February 6). Part 1: Introduction and methodology. Accessed on March, 6, 2024 from https://www.fifatrainingcentre....
 
5.
Brito Souza, D., López-Del Campo, R., Blanco-Pita, H., Resta, R., & Del Coso, J. (2019). A new paradigm to understand success in professional football: analysis of match statistics in LaLiga for 8 complete seasons. International Journal of Performance Analysis in Sport, 19(4), 543–555. https://doi.org/10.1080/247486....
 
6.
Brito Souza, D., López-Del Campo, R., Blanco-Pita, H., Resta, R., & Del Coso, J. (2020). Association of match running performance with and without ball possession to football performance. International Journal of Performance Analysis in Sport, 20(3), 483–494. https://doi.org/10.1080/247486....
 
7.
Buchheit, M., Allen, A., Poon, T. K., Modonutti, M., Gregson, W., & Di Salvo, V. (2014). Integrating different tracking systems in football: multiple camera semi-automatic system, local position measurement and GPS technologies. Journal of Sports Sciences, 32(20), 1844–1857. https://doi.org/10.1080/026404....
 
8.
Chen, T., & Guestrin, C. (2016). XGBoost. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 785–794. https://doi.org/10.1145/293967....
 
9.
Chmura, P., Oliva-Lozano, J. M., Muyor, J. M., Andrzejewski, M., Chmura, J., Czarniecki, S., Kowalczuk, E., Rokita, A., & Konefał, M. (2022). Physical performance indicators and team success in the German soccer. Journal of Human Kinetics, 83(1), 257–265. https://doi.org/10.2478/hukin-....
 
10.
de Jong, L. M. S., Gastin, P. B., Angelova, M., Bruce, L., & Dwyer, D. B. (2020). Technical determinants of success in professional women’s soccer: A wider range of variables reveals new insights. Plos One, 15(10), 1–12. https://doi.org/10.1371/journa....
 
11.
FIFA. (2023a). Enhanced Football Intelligence: Explanation Document .
 
12.
FIFA. (2023b, July 14). Post match summary reports. Accessed on March 6, 2024, from https://www.fifatrainingcentre....
 
13.
Gourh, W., Poojary, K., Vengarai, M., & Parkar, N. (2020). Football prediction using XGBoost algorithm: a literature review . Journal of Physical Sciences, Engineering and Technology, 12(1), 109–112.
 
14.
Gregory, S., Robertson, S., Aughey, R., & Duthie, G. (2022). The influence of tactical and match context on player movement in football. Journal of Sports Sciences, 40(9), 1063–1077. https://doi.org/10.1080/026404....
 
15.
Hand, D. J., & Till, R. J. (2001). A Simple Generalisation of the Area Under the ROC Curve for Multiple Class Classification Problems. Machine Learning, 45(2), 171–186. https://doi.org/10.1023/A:1010....
 
16.
Horvat, T., & Job, J. (2020). The use of machine learning in sport outcome prediction: A review. The Use of Machine Learning in Sport Outcome Prediction: A Review, 10(5), 1–28. https://doi.org/10.1002/widm.1....
 
17.
Konefał, M., Chmura, P., Rybka, K., Chmura, J., Huzarski, M., & Andrzejewski, M. (2019). What frequency of technical activity is needed to improve results? New approach to analysis of match status in professional soccer. International Journal of Environmental Research and Public Health, 16(12), 2233. https://doi.org/10.3390/ijerph....
 
18.
Kubayi, A., & Larkin, P. (2020). Technical performance of soccer teams according to match outcome at the 2019 FIFA Women’s World Cup. International Journal of Performance Analysis in Sport, 20(5), 908–916. https://doi.org/10.1080/247486....
 
19.
Lapré, M. A., & Palazzolo, E. M. (2022). Quantifying the impact of imbalanced groups in FIFA Women’s World Cup tournaments 1991–2019. Journal of Quantitative Analysis in Sports, 18(3), 187–199. https://doi.org/10.1515/jqas-2....
 
20.
Lepschy, H., Wäsche, H., & Woll, A. (2020). Success factors in football: an analysis of the German Bundesliga. International Journal of Performance Analysis in Sport, 20(2), 150–164. https://doi.org/10.1080/247486....
 
21.
Lu, C. J., Lee, T.-S., Wang, C.-C., & Chen, W.-J. (2021). Improving sports outcome prediction process using integrating adaptive weighted features and machine learning techniques. Processes, 9(9), 1–16. https://doi.org/10.3390/pr9091....
 
22.
Lu, Y., Pareek, A., Lavoie-Gagne, O. Z., Forlenza, E. M., Patel, B. H., Reinholz, A. K., Forsythe, B., & Camp, C. L. (2022). Machine learning for predicting lower extremity muscle strain in National Basketball Association Athletes. Orthopaedic Journal of Sports Medicine, 10(7), 1–11. https://doi.org/10.1177/232596....
 
23.
Lundberg, S., & Lee, S.-I. (2017). A unified approach to interpreting model predictions. From https://arxiv.org/abs/1705.078...; accessed on 22 March 2024.
 
24.
Mandorino, M., Tessitore, A., Leduc, C., Persichetti, V., Morabito, M., & Lacome, M. (2023). A New Approach to Quantify Soccer Players’ Readiness through Machine Learning Techniques. Applied Sciences, 13(15), 8808. https://doi.org/10.3390/app131....
 
25.
Oliva-Lozano, J. M., Martínez-Puertas, H., Fortes, V., López-Del Campo, R., Resta, R., & Muyor, J. M. (2022). Is there any relationship between match running, technical-tactical performance, and team success in professional soccer? A longitudinal study in the first and second divisions of LaLiga. Biology of Sport, 40(2), 1–8.
 
26.
Paul, D. J., Bradley, P. S., & Nassis, G. P. (2015). Factors affecting match running performance of elite soccer players: shedding some light on the complexity. International Journal of Sports Physiology and Performance, 10(4), 516–519. https://doi.org/10.1123/ijspp.....
 
27.
Pino-Ortega, J., Oliva-Lozano, J. M., Gantois, P., Nakamura, F. Y., & Rico-González, M. (2021). Comparison of the validity and reliability of local positioning systems against other tracking technologies in team sport: A systematic review. Proceedings of the Institution of Mechanical Engineers, Part P: Journal of Sports Engineering and Technology, 175433712098823–175433712098823. https://doi.org/10.1177/175433....
 
28.
Rossi, A., Pappalardo, L., Cintia, P., Iaia, F. M., Fernàndez, J., & Medina, D. (2018). Effective injury forecasting in soccer with GPS training data and machine learning. PLOS ONE, 13(7), e0201264. https://doi.org/10.1371/journa....
 
29.
Vescovi, J. D., & Falenchuk, O. (2019). Contextual factors on physical demands in professional women’s soccer: female athletes in motion study. European Journal of Sport Science, 19(2), 141–146. https://doi.org/10.1080/174613....
 
30.
Yang, G., Leicht, A. S., Lago, C., & Gómez, M.-Á. (2018). Key team physical and technical performance indicators indicative of team quality in the soccer Chinese super league. Research in Sports Medicine, 26(2), 158–167. https://doi.org/10.1080/154386....
 
eISSN:1899-7562
ISSN:1640-5544
Journals System - logo
Scroll to top