xT - and winning?
Expected threat - a tool for evaluating performance
In football, we often talk about goals and shots, but what if the most important actions happen long before the ball reaches the net? With expected threat we can quantify how teams and players create dangerous situations – and perhaps even predict who will win. During EURO last summer, 45,630 passes and 44,139 carries were completed. This article explores which of these actions add the most value and identifies the players and teams excelling at them. Using position-based expected threat (xT), I will explore questions such as:
- Is there a connection between xT and winning?
- Which teams and players created most xT per game during EURO 2024?
- Which player positions are most efficient at creating xT?
Position-based expected threat
Position-based expected threat assigns a value to each point on the pitch. These values is probabilities for their respective areas and used for finding expected threat, or xT. Each probability is the chance for scoring from that particular position, either directly or after moving the ball elsewhere before ball is lost or out of play. The grid below is an example with probabilities split into 12x8 bins, where each bin containing these probabilities. These probabilities are based on multiple seasons of football data. They are made with the use of a mathematical concept called markow chains, which calculate probabilities for sequence of events. This thought were first used in football by Sarah Rudd, and later Karun Singh used the same idea and called it “expected threat” in an article in 2018 (you can read here).
Figure 1. Grid with probabilities
A value of 1 means every sequence of play from that location would result in a goal, either directly from a shot, or indirectly after moving the ball elsewhere. The values obviously increase the closer we get to the opposition goal. The difference in probabilities between the end and start location give the xT-value. Lets use an example to illustrate.
Location (A) has a prability of 0.019. This means a 1,9% chance of scoring either directly from A or after moving the ball to another location. If the ball is moved from (A) to (B), we can calculate xT for that movement with subtracting p(B)-p(A). The player passing the ball (Calhanoglu in this case) is credited with 0.238 xT, increasing his team’s chance of scoring by 23,8 %. The numbers in the grid above are from McKay Johns’ github (see link at bottom). Credit to him for his valuable sharing and contributions. There is multiple versions of this type of grid. Tom Worville wrote an article where he covers it more in-depth: here. With a foundational understanding of xT, let’s move into real-world applications.
xT and Winning: Is There a Connection?
Using data from the Premier League 2017/2018 season, I investigated whether teams with higher xT values consistently performed better, measured with leaguerank. Here, I only measure passes, as carries were not a part of the Statsbomb data at that point.
The results indicate top teams generate significantly more xT. The old “top six” teams stand apart from the rest, underscoring the influence of xT on league outcomes. Carrying out a correlation test give more information about the size of connection between the two.
Figure 3. Pearson correlation for xT and leaguerank PL 2017/2018
The correlation coefficient measures the linear relationship between xT and leaguerank. Pearson correlation is measured from -1 to 1, where 1 meaning xT value explaining all league ranks, 0 meaning there is no connection, and -1 is negative correlation. Here, leaguerank is flipped to avoid negative correlation (1st place would originally mean low leaguerank). There is a strong relationship (0.785) between xT and leaguerank. Teams that accumulated more xT were more likely to finish higher in the table. There are definately more factors affecting the outcome, but the high correlation suggest xT as a relevant metric. After establishing xT’s relevance for 2017/2018, let’s apply this to present time and EURO 2024.
EURO 2024
Here are three examples of different passes and their xT values during EURO 2024. These passes had some of the highest xT values in the tournament.
A lot of experienced players are on top for passing. One observation from this figure is the much higher number of xT created by passes, compared to carries. Passing is an easier and normally quicker way to get the ball into dangerous areas, unless your name is Jeremy Doku. Doku is a clear example of a player tasked with carrying the ball and creating chances. A comparison between players make more sense for players in similar position and roles (original positions is taken from Hudl/Statsbomb lineups). From those positions, I then made six groups.
Wingers were top for carries in the position group, and high for passes as well. This visual confirms Doku's special ability of running with the ball. It also shows top passers like Eriksen and Tadic, as well as players who excell at doing both, like Conceição and Williams.