TY - JOUR
T1 - Classifying Soccer Players Based on Physical Capacities and Match-Specific Running Performance Using Machine Learning
AU - de Haan, Michel
AU - van der Zwaard, Stephan
AU - Sanders, Jurrit
AU - Beek, Peter J.
AU - Jaspers, Richard T.
N1 - Publisher Copyright:
© Journal of Sports Science and Medicine.
PY - 2025
Y1 - 2025
N2 - Sprint and endurance capacities seem to be mutually exclusive or at least at odds with each other. However, this relationship has not been investigated in soccer, which appeals to both well-developed sprint and endurance capacities. This study explores the potential of machine learning to identify soccer players based on their unique combinations of sprint and endurance capacities and sprint and endurance match-specific running performance. In this context, the relationships between sprint and endurance capacities and between physical capacities and match-specific running performance are examined in detail. Match-specific running data were collected from 31 young elite male soccer players over two consecutive seasons. Additionally, these participants underwent exercise testing, consisting of a 20-meter sprint test and an incremental treadmill test to measure maximal oxygen uptake (V̇ O2max). Subgroups were identified using k-means clustering and subgroup discovery, based on players’ sprint and endurance capacities, sprint and endurance match-specific running performance, and playing position. Three distinct subgroups were identified using machine learning: players with high sprint capacity and sprinted meters (n = 4), players with high endurance capacity and meters ran at moderate and high intensities (n = 6), and players without high physical capacities or matching match-specific running performance (n = 14). Across all players, there was no significant relationship between 20-meter sprint speed and normalized V̇ O2max (R2 = 0.085, P = 0.17), although 20-meter sprint speed was positively related to average match sprint distance (R2 = 0.168, P = 0.03) and normalized V̇ O2max to average match distance at moderate and high intensities (R2 = 0.151, P = 0.04). In young elite soccer players, sprint and endurance capacities show positive, moderate, relationships with corresponding match-specific running performance, but those capacities do not appear to be mutually exclusive or opposing. Clustering allows for identification of players who may benefit from alternative strategic roles during matches, are at risk of overuse, or could benefit from individualized training. This method can assist coaches in designing tailored training programs and optimizing overall match strategy.
AB - Sprint and endurance capacities seem to be mutually exclusive or at least at odds with each other. However, this relationship has not been investigated in soccer, which appeals to both well-developed sprint and endurance capacities. This study explores the potential of machine learning to identify soccer players based on their unique combinations of sprint and endurance capacities and sprint and endurance match-specific running performance. In this context, the relationships between sprint and endurance capacities and between physical capacities and match-specific running performance are examined in detail. Match-specific running data were collected from 31 young elite male soccer players over two consecutive seasons. Additionally, these participants underwent exercise testing, consisting of a 20-meter sprint test and an incremental treadmill test to measure maximal oxygen uptake (V̇ O2max). Subgroups were identified using k-means clustering and subgroup discovery, based on players’ sprint and endurance capacities, sprint and endurance match-specific running performance, and playing position. Three distinct subgroups were identified using machine learning: players with high sprint capacity and sprinted meters (n = 4), players with high endurance capacity and meters ran at moderate and high intensities (n = 6), and players without high physical capacities or matching match-specific running performance (n = 14). Across all players, there was no significant relationship between 20-meter sprint speed and normalized V̇ O2max (R2 = 0.085, P = 0.17), although 20-meter sprint speed was positively related to average match sprint distance (R2 = 0.168, P = 0.03) and normalized V̇ O2max to average match distance at moderate and high intensities (R2 = 0.151, P = 0.04). In young elite soccer players, sprint and endurance capacities show positive, moderate, relationships with corresponding match-specific running performance, but those capacities do not appear to be mutually exclusive or opposing. Clustering allows for identification of players who may benefit from alternative strategic roles during matches, are at risk of overuse, or could benefit from individualized training. This method can assist coaches in designing tailored training programs and optimizing overall match strategy.
KW - Clustering
KW - football
KW - MAS
KW - MSS
KW - sprint speed
KW - V̇ O2max
UR - https://www.scopus.com/pages/publications/105021200334
UR - https://www.scopus.com/inward/citedby.url?scp=105021200334&partnerID=8YFLogxK
U2 - 10.52082/jssm.2025.764
DO - 10.52082/jssm.2025.764
M3 - Article
C2 - 41210088
AN - SCOPUS:105021200334
SN - 1303-2968
VL - 24
SP - 764
EP - 778
JO - Journal of Sports Science & Medicine
JF - Journal of Sports Science & Medicine
IS - 4
ER -