The Impact of Formations on Football Matches Using Double Machine Learning. Is it worth parking the bus?
This study addresses a central tactical dilemma for football coaches: whether to employ a defensive strategy, colloquially known as "parking the bus", or a more offensive one. Using an advanced Double Machine Learning (DML) framework, this project pr…
Authors: Refer to original PDF
The Impact of F ormations on F o otball Matc hes Using Double Mac hine Learning. Is it w orth parking the bus? Gen ´ ıs Ruiz-Men´ arguez 1 and Lloren¸ c Badiella 1 1 Dep artament de Matem` atiques, Universitat A ut` onoma de Bar c elona, Sp ain Abstract This study addresses a cen tral tactical dilemma for fo otball coaches: whether to emplo y a defensiv e strategy , collo quially known as ”parking the bus,” or a more offensive one. Using an adv anced Double Mac hine Learning (DML) framework, this pro ject provides a robust and in ter- pretable to ol to estimate the causal impact of different formations on key match outcomes such as goal difference, p ossession, corners, and disciplinary actions. Lev eraging a dataset of ov er 22,000 matc hes from top Europ ean leagues, formations were categorized in to six represen tative t yp es based on tactical structure and exp ert consultation. A ma jor metho dological contribution lies in the adaptation of DML to handle categorical treatments, sp ecifically formation com bina- tions, through a no vel matrix-based residualization process—allowing for a detailed estimation of formation-v ersus-formation effects that can inform a coac h’s tactical decision-making. Results sho w that while offensive formations lik e 4-3-3 and 4-2-3-1 offer modest statistical adv antages in p ossession and corners, their impact on goals is limited. F urthermore, no evidence supp orts the idea that defensiv e formations—commonly asso ciated with parking the bus—increase a team’s winning potential. Additionally , red cards app ear unaffected b y formation choice, suggest- ing other b eha vioral factors dominate. Although this approach do es not fully capture all asp ects of pla ying st yle or team strength, it provides a v aluable framework for coaches to analyze tactical efficiency and sets a precedent for future research in sp orts analytics. Keyw ords: F ootball, F ormation, Double Mac hine Learning, Causal Inference, Confounding, XG- Bo ost 1 In tro duction F ootball coac hes face a critical decision b efore each matc h: which formation to implemen t to maximize their team’s performance. The organization of the eleven starting play ers into sp ecific structural patterns is a foundational element of tactical strategy . This pro ject aims to pro vide data-driv en insigh ts for this decision b y analyzing whether certain defensiv e formations, commonly referred to as ”parking the bus” (1), are truly effective in specific scenarios. Jos´ e Mourinho, former Chelsea F C manager, p opularized this term to describ e a riv al’s excessively defensiv e approach. 1 These defensive strategies are often employ ed against stronger opp onen ts (2, 3); yet their actual effectiv eness remains a sub ject of debate. This study addresses this debate directly for the benefit of coac hes and tactical analysts. W e examine the direct relationship b et w een formations and several k ey matc h outcomes, mo ving b ey ond anecdotal evidence to provide a robust statistical analysis. After generalizing and grouping formations based on tactical similarity and exp ert criteria, a Double Mac hine Learning framework is applied to estimate the isolated impact of each formation com bination on v arious v ariables, such as goal difference. While formations ma y not b e the sole predictor of match statistics, understanding whether sp ecific systems consisten tly enhance or hinder k ey p erformance outcomes is in v aluable for strategic planning. Ultimately , the ob jectiv e is to assess the isolated impact of formations themselves, providing coac hes with a clear, un biased lo ok at how tactical systems, indep enden t of individual team quality , influence matc h results. 2 Data Pro cessing Matc h data for this study were obtained from the Sp ortmonks API (4), a commercial pro vider of comprehensiv e fo otball data. The dataset comprises o ver 22,000 professional league fixtures and includes a wide range of v ariables critical for our analysis, suc h as match statistics, team formations, and con textual information. T o ensure the dataset w as robust and represen tative of mo dern football tactics, sp ecific criteria w ere applied for data inclusion. The selection of seasons and leagues was guided by tw o primary considerations: regulatory consistency and tactical diversit y . 1. Season Selection: The analysis includes seasons from 2018–2019 to 2024–2025. This broad temp oral scop e w as chosen to ensure a sufficient data volume for robust statistical mo del- ing. While the ma jority of this p eriod reflects the mo dern tactical environmen t shap ed by the ”fiv e substitutions” rule, seasons prior to this regulatory c hange (2018–2019, 2019–2020, and 2020–2021) were also included to significantly expand the dataset’s size and statistical p o wer. The effects of this regulatory change are implicitly captured by the inclusion of b oth league and season within the confounder set, whic h accoun ts for temp oral and league-sp ecific v ariations that ma y b e correlated with the rule change. 2. League Selection: T o create a geographically and tactically div erse dataset, matc hes from the seven highest-ranked Europ ean men’s fo otball leagues according to UEF A co efficien ts w ere included: the first divisions of England, Italy , Spain, German y , F rance, the Netherlands, and P ortugal. T o further enhance the statistical p o wer of the mo dels, additional leagues w ere selected based on tw o criteria: a minim um of 32 regular-season matc hes and sufficient data a v ailabilit y from the API. This led to the inclusion of the top divisions of T urk ey , Belgium, and P oland, as well as the second divisions of Spain, Italy , and England. 2 2.1 Confounding V ariables The primary ob jective of this analysis is to isolate the in trinsic relationship b et ween a team’s formation and a riv al’s formation with resp ect to v arious match outcomes. T o achiev e this, a set of confounding v ariables has b een selected. These v ariables are chosen to control for factors that influence b oth team strength and match outcomes but are not a direct result of the c hosen formation. Consequen tly , mediator v ariables such as accumulated goals or possession, whic h are themselves outcome s of a team’s pla ying style, will b e inten tionally excluded from the mo del. The follo wing v ariables are used as confounders: • Matc h Con text: The season, league, or day of the week of the fixture. • T eam Side: A binary indicator for whether the main team pla yed at home or aw a y . • T eam Strength Metrics: – The rate of accum ulated p oin ts for b oth teams prior to the match. – A binary flag indicating whether eac h team w as sim ultaneously comp eting in the UEF A Champions League. – The league ranking of eac h team at the time of the fixture. – The winning streak from last matc hes. • En vironmen tal F actors: W eather data for the match v enue. The final dataset included additional k ey v ariables for each fixture which are used as mediators, treatmen t v ariables or target v ariables at some p oin t of the analysis: • P erformance Metrics: Goals, corners, p ossession, and disciplinary actions (red and yello w cards) for b oth teams. • T actical Information: The formation system used b y b oth teams (e.g., 4-3-3). • Con textual V ariables: In-matc h even ts (e.g., p enalties, substitutions), lineups, and weather data (e.g., temp erature, humidit y). V ariables that could act as mediators b et ween a formation c hoice and the outcome—such as in- game substitutions or team fouls—were inten tionally excluded from the set of confounders. This distinction is crucial to ensure that statistical mo del isolates the pure direct effect of the formation, rather than capturing indirect effects. While these v ariables are not used as confounders, some, suc h as p ossession and disciplinary actions, serve as secondary target v ariables for our analysis. The ra w dataset underw ent several cleaning and filtering steps to ensure data qualit y and method- ological consistency . Specific seasons from the Belgian Pro League and T urkish S¨ up er Lig w ere ex- cluded from the analysis due to incomplete matc h data, which w ould hav e compromised the integrit y of feature engineering. 3 T o fo cus on a consistent comp etitiv e en vironment, non-regular league fixtures, such as pla y-offs and play-outs, w ere remo ved from the dataset. F urthermore, to mitigate p oten tial biases from early- season instability and late-season goal achiev emen t effects, matc hes from the first t wo and final four rounds of each season were also discarded. This ensures that the analysis is based on matc hes where teams are at a comparable lev el of statistical p erformance and motiv ation. T o prop erly con trol for team quality and contextual factors, several v ariables w ere engineered to quan tify team strength at the time of eac h fixture. These v ariables served as potential confounders in the relationship b et w een treatmen t and outcome v ariables. Betting o dds data w ere inten tionally excluded from this set, as suc h information ma y implicitly incorp orate factors related to formation c hoice, p oten tially introducing a source of endogeneity . The engineered team strength v ariables include: • Accum ulated P oints: A numerical v ariable representing the ratio of p oin ts earned out of the total p ossible in prior matches, computed separately for home and aw a y p erformances. • Champions League Flag: A binary v ariable indicating whether the home or a wa y team was comp eting in the UEF A Champions League at the time of the fixture. • T eam Ranking: An in teger v ariable reflecting the league p osition of each team, with the same rank assigned to teams with equal p oin ts. Consisten t with the study’s ob jective to isolate the pure causal effect of formations, v ariables that are structurally related to formations which are considered mediators -suc h as accumulated goals or fouls- were not included in the confounder set. This careful v ariable selection is crucial to preven t the mo del from capturing indirect effects and ensures that the estimated causal impact is a true reflection of formation c hoice itself. 3 Metho dology In order to estimate formations’ impact on fo otball match outcomes w e propose to use a Double Mac hine Learning approach (5), that will remov e the effect of confounders. F or the DML 1st step consider the following linear mo del: Y = D β + X γ + ε • Y : outcome v ariable (e.g., goal difference) • D : treatmen t v ariable (e.g., formation combination of b oth teams) • X : matrix of control v ariables (confounders), • β : causal parameter of interest, 4 • γ : nuisance parameters, • ε : error term. 3.1 Double Machine Learning Double Machine Learning states that the effect of a treatmen t v ariable D on an outcome Y , after accoun ting for con trol v ariables X , can b e obtained by first removing from b oth the treatment and the outcome the v ariation explained by the confounding v ariables, and then regressing the residuals of the outcome on the residuals of the treatment. The resulting co efficien t corresp onds to the partial effect β of the treatment that is orthogonal to the confounding v ariables. F unction ˆ f ( X ) is defined as the Mac hine Learning estimator of Y on X , and ˆ g ( X ) as the estimator of D on X . Both ˆ f ( X ) and ˆ g ( X ) can b e adaptive to high dimensional spaces, and capture nonlinearities, in teractions, and complex dep endencies automatically . The final stage is obtained isolating the ˆ β estimator from the 3rd step residualization pro cess. Through sample splitting and cross-fitting, this mo dification main tains orthogonality and robust- ness while allo wing the estimation of treatmen t effects in high-dimensional or nonlinear en vironments. Some examples of this extension include using b oosting or decision trees mo dels as a substitution of traditional linear mo dels. 3.2 New Approach for DML with categorical treatment Applying categorical treatment in the second DML step enables us to fully capture the partial effect of the target v ariable on eac h differen t treatment v ariable. This approac h is highly effectiv e when dealing with m ulticategorical and indep enden t treatment v ariables. T o consider eac h formation com bination in the v ariable D , dummy v ariables for k 2 − 1 unique formation com binations will b e generated, where k is the num b er of differen t formations and it is equal for b oth main and riv al team. The goal is to iden tify which formations are prone to contribute to a higher v alue in the target v ariable (e.g. goals difference), meaning winning by a larger margin in the case of goals. T o do so, each treatmen t v ariable vector D i,j corresp onds to the combination of the main team formation i and riv al team formation j . Therefore, the resulting mo dels can b e expressed in a t wo-dimensional matrix. D i,j = α i,j X + ε D i,j Where, • D i,j : is a dummy vector which con tains 1 for the combination of main team formation i and riv al team formation j row, -1 for the unused formation row ( D k 2 ) and 0 otherwise. • α i,j is a v ector of co efficien ts corresp onding to each v ariable in the X matrix. 5 Note that the D k,k formation com bination is ommitted, as it is not used to av oid m ulticollinearit y . Nev ertheless, this last formation data can b e deduced from the other ones . This enco ding scheme enables us to gather the true effect of each formation combination D i,j on the target v ariable Y . (See effect co ding (6)). r D i,j = D i,j − ˆ D i,j Where r D i,j is the residual of the treatmen t v ariable D i,j prediction. Consequen tly , the third mo del is computed: r Y = β 1 , 1 r D 1 , 1 + · · · + β 1 ,k r D 1 ,k + · · · + β k,k − 1 r D k,k − 1 + ϵ The greater the β i,j is, the greater p ositiv e impact it will hav e on the target v ariable Y . The estimator ˆ β k,k can b e obtained as: ˆ β k,k = − ( ˆ β 1 , 1 + · · · + ˆ β 1 ,k + · · · + ˆ β k,k − 1 ) Since β k,k b elongs to the diagonal estimators, β k,k = 0. 3.2.1 Categorical T reatment Prop erties By applying this new approac h, the treatment D is orthogonalized on the confounders X . After regressing each dumm y D i,j on X and taking residuals r D i,j , eac h residual r D i is orthogonal to the con trols X . As any correlation b et ween D i,j and Y that was due to X has b een purged, the remaining correlation b et ween r D i,j and r Y also captures the true causal effect. F or every dumm y v ariable D i,j , a separate orthogonal moment condition is defined after residualization. Therefore, the orthogonalit y condition required by DML holds for every formation combination separately and the original framew ork prop erties remain v alid. Ho wev er, this new approach leads to a differen t in terpretation of the parameter β . Eac h coefficient β i,j estimates the causal effect on goal difference of choosing the main team formation i given the riv al team formation j in comparison to the omitted combination D k,k . 3.3 Pro ject Metho dology T o address the formation impact analysis, the standard DML framework w as adapted. The core of our approac h inv olv es a tw o-stage residualization pro cess, which can b e summarized as follows: 1. First-Stage Residualization: A machine learning mo del, sp ecifically an X GBo ost Regressor (7), is used to predict the target v ariable (e.g., goal difference) based on the set of confounding v ariables. The resulting residual, denoted as r Y , represen ts the component of the target v ariable not explained b y these confounders. 6 2. T reatmen t Model Residualization: Similarly , an XGBoost Regressor is trained to predict the treatmen t v ariable (formation com bination) using the same set of confounders. T o apply this to our categorical treatment, a separate regression mo del is computed for each formation com bination. The resulting residuals, r D , represen t the v ariation in the treatmen t not explained b y the confounders. 3. Causal Parameter Estimation: Finally , the causal effect of formations is estimated by a simple linear regression of the target residuals ( r Y ) on the treatment residuals ( r D ). The co efficien ts from this final regression, ˆ β i,j , provide the unbiased causal estimates of the effect of eac h formation combination on the target v ariable. 4 F ormation Analysis Results This approach consists of analyzing the impact of differen t tactical strategies on match outcomes. T o do so, formations from the dataset w ere group ed into six distinct categories, sorted from most defensiv e to most offensive. 4.1 F ormation T reatment F ormations are represen ted by a sequence of n umbers, sp ecifying the distribution of outfield play ers across the defensiv e, midfield, and offensive lines. Since the goalkeeper is not counted, the num b ers sum to ten. Initially , the dataset contained 28 distinct formations. T o enable robust statistical analysis and mitigate the issues of data sparsity and lack of statistical p ow er that would result from an exces- siv ely large treatment matrix, these formations were group ed into six representativ e categories. This pro cess was conducted based on exp ert consultation and a tactical similarity criterion, ensuring that formations with a similar structural foundation w ere aggregated. F or example, formations such as 4-3-3 and its v ariations (e.g., 4-3-1-2) w ere categorized together as they share a similar core structure. This grouping approach addresses the issue of differen t represen tations of the same underlying tactical system b eing treated as distinct entities. The six resulting formations, sorted from the most defensive to the most offensive one, and their prop ortion in the dataset are shown in T able 1. T able 1. F ormation p ercen tages across different leagues League 5-4-1 4-4-2 3-5-2 4-2-3-1 4-3-3 3-4-3 La Liga 8.5% 32.57% 10.79% 22.72% 20.64% 4.78% Premier League 7.75% 15.32% 9.26% 31.23% 24.43% 12.01% Serie A 1.38% 6.16% 33.24% 18.54% 25.45% 15.23% Bundesliga 4.48% 13.9% 21.73% 26.68% 13.41% 19.81% Ligue 1 8.72% 14.32% 15.65% 26.96% 20.17% 14.18% 7 The most frequently used formations across all leagues are the 4-2-3-1 and 4-3-3. F urthermore, analysis confirmed that the prop ortion of formations used by home and aw a y teams is nearly identical, whic h justifies a simplified mo deling approach for the formation treatment. Find b elo w some fixture statistics on a v erage group ed by formation and still ordered from most defensiv e to most offensive ones. T able 2. Av erage Match Statistics by F ormation across All Leagues F ormation 5-4-1 4-4-2 3-5-2 4-2-3-1 4-3-3 3-4-3 Goals 1.236 1.249 1.268 1.392 1.437 1.341 Red Cards 0.110 0.114 0.104 0.106 0.103 0.098 Y ellow Cards 2.169 2.246 2.215 2.110 2.075 2.188 P ossession 48.98% 48.62% 48.24% 50.72% 51.63% 49.80% Corners 4.751 4.719 4.754 5.004 5.107 4.854 Although some formations, such as the 4-2-3-1 and 4-3-3, are asso ciated with higher av erage goals and p ossession, these preliminary correlations are not sufficient for a v alid causal inference. The ob- serv ed statistical relationships are lik ely a result of confounding factors and complex in teractions. The nominal formation used is a simplification of a team’s dynamic tactical structure, which can change m ultiple times during a matc h due to in-game adaptations, substitutions, and opponent resp onses. F urthermore, a single formation can accommo date a wide v ariety of playing styles, which are not captured b y a simple numerical represen tation. Therefore, it is crucial to emplo y a robust causal inference framew ork to isolate the true effect of a formation from these confounding v ariables. 4.2 Mo del P erformance Fiv e separate XGBoost Regressor mo dels were trained, each corresp onding to a different target v ari- able. The hyperparameter tuning for each mo del was p erformed aiming to maximize the negative mean squared error (-MSE). The MSE is the primary metric for ev aluating the predictive p erformance of the first-stage regressions, as a v alue of zero corresp onds to a p erfect fit. A critical hyperparameter choice was setting the upp er limit for max depth to 5. This constraint limits the complexity of each individual tree, which reduces the model’s susceptibility to o verfit- ting and helps ensure a low er v ariance in the predictions. Consequently , this approach also helps main tain symmetry in the final ˆ β estimations by preven ting the mo del from making o verly sp ecific appro ximations. The predictive p erformance of each mo del, ev aluated on a held-out test set, is summarized in the table b elo w. The R 2 metric is pro vided to quantify the prop ortion of v ariance in each target v ariable explained b y the set of confounders. 8 T able 3. Predictiv e Performance Metrics for First-Stage XGBoost Mo dels Statistic MSE R 2 Goals 2.70 0.118 Red Cards 0.19 0.003 Y ellow Cards 3.18 0.030 P ossession 343.35 0.298 Corners 7.98 0.781 As the table shows, the mean squared error v alues are not directly comparable across mo dels due to the differing units and scales of the target v ariables (e.g., possession is a p ercen tage, while red cards are discrete coun ts). The mo del predicting corners exhibits the highest R 2 v alue, indicating it captures the largest prop ortion of v ariance among the five targets. It is imp ortant to note that maximizing predictiv e accuracy was not the primary goal of this study . The first-stage mo dels serv e as a means to an end, sp ecifically to p erform the residualization necessary for obtaining the DML-estimated ˆ β i,j co efficien ts. Giv en the high degree of unpredictability inheren t in fo otball, R 2 v alues for many match outcomes are exp ected to b e low. 4.3 F ormation Matrix This DML approac h enabled the computation of sev eral causal analyses regarding the impact of formation com binations on v arious matc h outcomes. T o interpret the resulting matrices of estimators, it is imp ortan t to recall some k ey theoretical prop erties of the framework: • The diagonal of the matrix represents a formation’s effect against itself, and th us, its v alue is defined as zero. • The matrix is theoretically symmetrical, with ˆ β i,j = − ˆ β j,i . Any minor deviations from p erfect symmetry are attributed to the appro ximations made by the machine learning mo dels. The statistical significance of the estimators is determined b y p-v alues, { ∗∗∗ p < 0 . 001 , ∗∗ p < 0 . 01 , ∗ p < 0 . 05 , ns p > = 0 . 05 } . Cells with a p-v alue b elow 0.05 are considered statistically significan t, indicating a rejection of the n ull hypothesis that the ˆ β i,j estimator is equal to zero. The following matrices presen t the pure causal effect betw een formation com binations for a sp ecific target v ariable. T o provide a more complete tactical context, these estimations can b e adjusted for the effects of pla ying home or aw ay . The av erage effect of the target v ariable for a home team is estimated and added to the causal co efficien t, providing a close approximation of the total side effect. This calculation is represen ted as: ˆ β side i,j = ˆ β i,j + E ( ˆ Y home ) where E ( ˆ Y home ) is the estimated av erage effect on the target v ariable for the home team (e.g., goals difference). 9 4.3.1 T arget V ariable 1: Goals Difference The resulting ˆ β i,j estimators for goals difference as the target v ariable are: Figure 1. ˆ β i,j co efficien ts and p-v alues for goals difference The analysis of the estimators rev eals three statistically significant ˆ β i,j co efficien ts, eac h with a p-v alue b elo w the 0.05 threshold. These significant estimators are: • 4-2-3-1 vs. 3-5-2: This combination pro vides an estimated adv an tage of 0.16 goals for the 4-2-3-1 formation ( p < 0 . 001), suggesting strong statistical significance. • 4-3-3 vs. 5-4-1: This combination sho ws an adv antage of 0.17 goals for the 4-3-3 formation ( p < 0 . 05). • 4-3-3 vs. 4-4-2: This combination results in a 0.11 goal adv antage for the 4-3-3 formation ( p < 0 . 05). All other estimated co efficien ts for goal difference are not statistically significan t and, thus, can b e considered zero. T o provide a more comprehensive tactical con text, the pure formation effect can be combined with the well-established home-field adv an tage. The av erage home-team effect, denoted as E ( ˆ Y home ), is estimated to b e 0.285 goals when the target v ariable is goals difference. Consequently , the total 10 estimated adv antage of a 4-2-3-1 formation playing at home against a 3-5-2 would b e approximately +0.445 goals (0 . 16 + 0 . 285). The finding that only a small num b er of estimators are significan t suggests that formation c hoice alone do es not hav e a ma jor isolated impact on goal difference. 4.3.2 T arget v ariable 2: Red Cards difference The DML mo del was also applied to the difference in red cards betw een the main and riv al teams, considering b oth direct red cards and those resulting from a second y ellow card. In this analysis, the data w ere not stratified by home or aw ay team, as the goal was to assess the pure, unadjusted causal effect of eac h formation combination on the num b er of red cards. Figure 2. ˆ β i,j co efficien ts and p-v alues for red cards difference The results indicate that there are no statistically significant estimators for the red cards difference, as all p-v alues are greater than the 0.05 threshold. Consequen tly , w e cannot reject the n ull h yp othesis, and the estimated co efficien ts can b e considered zero ( ˆ β i,j = 0 ∀ i, j ). This lack of significance suggests that red cards are not causally influenced by the sp ecific formation com binations, particularly after controlling for confounding factors suc h as team strength and general matc h conditions. This finding aligns with the understanding that red cards are often a result of random, unpredictable match ev en ts and are more closely linked to a pla y er’s individual intensit y and b eha vior, whic h are difficult to quantify . 11 4.3.3 T arget V ariable 3: Y ellow Cards Difference The DML mo del was also applied to the difference in yello w cards b et ween the main and riv al teams, with the goal of assessing the pure causal effect of each formation com bination on this outcome. This analysis did not stratify the data b y home or aw a y team. Figure 3. ˆ β i,j co efficien ts and p-v alues for yello w cards difference The results rev eal three statistically significan t pairs of co efficien ts. As the matrix is symmetrical ( ˆ β i,j = − ˆ β j,i ), these represen t three unique formation combinations: • 5-4-1 vs. 4-2-3-1: The 5-4-1 formation is asso ciated with an estimated increase of 0.16 yello w cards on av erage ( p < 0 . 01). This suggests that a team in a 5-4-1 formation would receiv e appro ximately one additional yello w card ev ery six matc hes when facing a team in a 4-2-3-1 formation. • 3-5-2 vs. 4-2-3-1: The 3-5-2 formation is asso ciated with a larger estimated increase of 0.17 y ellow cards ( p < 0 . 001), indicating a strong statistical significance. • 3-5-2 vs. 4-3-3: This com bination results in a 0.16 y ello w card increase for the 3-5-2 formation ( p < 0 . 01). 12 These findings suggest that defensive formations incur a subtle increase in y ello w card penalties when compared to more offensiv e ones. How ev er, as noted b y Badiella Busquets (8), disciplinary actions suc h as y ellow cards are also heavily influenced by individual play er in tensity , which is not captured b y these mo dels. 4.3.4 T arget v ariable 4: Possession The DML mo del was also applied to analyze the causal impact of formation combinations on p osses- sion. The h yp othesis is that a team’s tactical structure significan tly influences its ability to control the ball during a matc h. Figure 4. ˆ β i,j co efficien ts and p-v alues for p ossession difference The results pro vide v aluable insights into the intrinsic b eha vior of formations with respect to p ossession. The matrix rev eals that 13 of the 18 unique formation com binations show statistically significan t co efficien ts, with most having a p-v alue of < 0 . 001. The matrix can b e divided into t wo key regions: a top-righ t blue region and a bottom-left red region, whic h represent offensive-defensiv e match ups. • Offensiv e vs. Defensive Match ups (T op-righ t): This region clearly sho ws that defen- siv e formations tend to hav e less con trol o ver p ossession. The estimated p ossession difference 13 b et w een defensive formations (5-4-1, 4-4-2, 3-5-2) and offensiv e ones (4-2-3-1, 4-3-3, 3-4-3) is consisten tly significan t, with all p-v alues less than 0.001. This indicates that formations with more midfielders and few er defenders are more effective at retaining p ossession. • Defensiv e vs. Offensiv e Match ups (Bottom-left): This region is a symmetrical reflection of the top-righ t, confirming that offensive formations hav e greater ball con trol when facing defensiv e ones. Ov erall, the 4-3-3 formation demonstrates the strongest p ositiv e effect on ball control when com- pared to other offensive formations. Interestingly , the 3-4-3, despite b eing highly offensive, shows less ball dominance than the 4-2-3-1, whic h could be due to a more direct, v ertical playing style with few er play ers in midfield to calmly circulate the ball. 4.3.5 T arget v ariable 5: Corners Finally , the influence of formations on corner kicks was also analyzed. The h yp othesis is that forma- tions with higher p ossession and a more offensiv e structure will generate more scoring opp ortunities, consequen tly leading to a higher num b er of corners. Figure 5. ˆ β i,j co efficien ts and p-v alues for corners difference The matrix of co efficien ts reveals that offensive formations hav e a statistically significant p ositiv e effect on corner kicks when facing defensive ones. This aligns with the findings from the possession 14 analysis, as teams with more ball con trol and a more aggressiv e tactical setup create more attacking opp ortunities. F or example, the 4-2-3-1, the most common formation in our dataset, shows a p ositiv e estimated effect on corners when paired against defensiv e systems lik e the 5-4-1 (+0.19), 4-4-2 (+0.18), and 3-5-2 (+0.23). While these effects are statistically significant, their practical magnitude is small. Giv en that the a verage num b er of corners p er team is around five p er match, the estimated c hanges are not substan tial enough to b e considered a ma jor tactical adv an tage. 5 Discussion This study rev eals that while fo otball formations provide a baseline tactical framew ork, they are not sufficien t to define or predict a team’s playing style or matc h outcome. Each nominal formation—suc h as 4-3-3 or 4-2-3-1 can embo dy a v ariety of tactical b eha viors, and the assumption that the starting formation remains static throughout the match o v erlo oks the dynamic nature of in-game adaptations. F urthermore, some teams emplo y differen t formations when attac king compared to when they are defending. Through the application of Double Machine Learning—redefined in this pro ject to handle the categorical treatmen t through a nov el matrix-based residualization of formation combinations—it b ecomes eviden t that defensiv e formations, p opularly known as ”parking the bus”, generally serv e to neutralize offensive threats rather than produce statistically adv antageous outcomes in terms of goals, p ossession, corners, or disciplinary measures. Ho wev er, ”parking the bus” often implies other strategies not considered in this analysis, such as time-wasting, tactical fouls, and slo wing do wn the pace of the game. Con versely , offensiv e formations lik e 4-2-3-1, 4-3-3, and 3-4-3 sho w a mo dest but statistically significan t p ositiv e impact on p ossession and corner differences when facing more defensive systems. Ho wev er, formation choice alone has minimal impact on red cards, which appear to b e driv en b y more random or b ehavioral factors. Imp ortan tly , the causal effect of formations on goal difference is limited, with only three formation combinations yielding significant estimators—underscoring that formations do not fully accoun t for scoring dynamics. Therefore, there is no statistical evidence that ”parking the bus” offers an y scoring or winning adv an tage. While these results provide insightful patterns, they come with certain limitations, including the criteria used for defining team strength and the simplification inv olv ed in formation grouping, which ma y not fully explain all asp ects of playing b ehavior. Nonetheless, this serves as a v aluable starting p oin t for deep er formation-based p erformance analysis and provides a clear, data-driv en framework that can help coac hes make more informed tactical decisions. 15 6 Ac kno wledgemen ts W e would lik e to express our sincere gratitude to Juan Jes ´ us Ro dr ´ ıguez and Juan Camilo V´ azquez for their in v aluable supp ort throughout this pro ject. Their deep fo otball kno wledge, esp ecially in helping group formations meaningfully and providing tactical insights, was essential to in terpreting the data accurately and grounding the analysis in real-w orld fo otball logic. 7 Author con tributions GR and LB conceived the study and outlined the article conten t. GR constructed the dataset, p erformed the statistical analyses and wrote the first draft of the article. GR and LB contributed in the methodological approac h and review ed of the man uscript. All authors ha v e read and approv ed the final v ersion of the article. 8 F unding This w ork was partially funded by the gran t R TI2018-096072-B-I00 from the Spanish Ministry of Science, Inno v ation and Univ ersities References [1] Tian yu Guan, Jiguo Cao, and Tim B Sw artz. P arking the bus. Journal of Quantitative Analysis in Sp orts , 19(4):263–272, 2023. [2] Mohamad Nizam Nazarudin, Ardo Okilanda, Y o vhandra Oc kta, Reshandi Nugraha, and Regi Dwi Septian. Ev aluating defensive strategies in fo otball: analysing the impact of defensiv e metrics on matc h outcomes. Journal of Physic al Educ ation and Sp ort , 25(5):1051–1059, 2025. [3] Rui F reitas, Anna V olossovitc h, Carlos H Almeida, and V eronica Vleck. Elite-level defensive p erformance in fo otball: A systematic review. German Journal of Exer cise and Sp ort R ese ar ch , 53(4):458–470, 2023. [4] Sp ortmonks. Sp ortmonks—sports data solutions for dev elop ers. https://www.sportmonks.com/ , 2025. Accessed on 1 July 2025. [5] Victor Chernozhuk ov, Denis Chetv eriko v, Mert Demirer, Esther Duflo, Christian Hansen, Whit- ney Newey , and James Robins. Double/debiased mac hine learning for treatment and structural parameters, 2018. [6] Manfred T e Grotenh uis, Ben Pelzer, Rob Eisinga, Rense Nieuw enh uis, Alexander Sc hmidt-Catran, and Rub en Konig. When size matters: adv an tages of w eighted effect coding in observ ational studies. International Journal of Public He alth , 62(1):163–167, 2017. 16 [7] Tianqi Chen and Carlos Guestrin. Xgb o ost: A scalable tree b oosting system. In Pr o c e e dings of the 22nd acm sigkdd international c onfer enc e on know le dge disc overy and data mining , pages 785–794, 2016. [8] Lloren¸ c Badiella, P edro Puig, Carlos Lago-P e ˜ nas, and Mart ´ ı Casals. Influence of red and y ellow cards on team p erformance in elite so ccer. A nnals of Op er ations R ese ar ch , 325(1):149–165, 2023. 17
Original Paper
Loading high-quality paper...
Comments & Academic Discussion
Loading comments...
Leave a Comment