We're now thirteen games into the 2011/12 English Premier League season, which means we're just over one third of the way towards crowning the Premier League champion. So far the season has looked like it's Manchester City's championship to lose, with the remaining teams left to battle for one of the three remaining Champions League spots. City's start to the season is certainly hot - 35 points from an available 39 (11-2-0 for a 2.69 PPM) and a +31 GD (2.38 GD per match) - and is tops at the 13 match mark in the league's 20 team/38 match history by besting the 05/06 Chelsea and 06/07 Manchester United totals for points (34 for a 2.62 PPM) and the 08/09 Chelsea total for GD (28 for a GD per match of 2.15). However, as that 08/09 Chelsea squad can attest, starting hot does not guarantee finishing at the top. In fact, of the 16 teams that led the league after thirteen matches during the league's 20 team era, only 6 of them have gone on to finish in that position at the end of the season. Five dropped to second, three dropped to third, one (2002/03 Liverpool) dropped to fifth, and one (1998/99 Aston Villa) dropped to sixth by season's end.
So just how hot has been Manchester's City's start when considering the variation in teams' performance that sit at the top of the table after match day 13, what does it mean as to the odds of them maintaining their torrid pace for points and goal differential the rest of the year, and what does it mean for their chances of finishing top of the table come May?
Quantifying City's Hot Start
Combing through Statto.com's historical archive of tables by match day, a comprehensive list of tables for match day 13 were compiled for the 1995/96 through 2010/11 seasons. The tables were then isolated for the teams at the top of the table for each season after the 13th match day. Luckily, the results for goals per match and points per match for these top-of-the-table clubs produce a normal distribution of data for each statistic, so a Z-score approach can be taken to quantify just how rare City's start is to this season.
Looking at goal differential per game, the next closest team to City's 2.38 GD per match after 13 matches is the 2008/09 Chelsea average of 2.15. The average GD of clubs at the top of the table after 13 matches is 1.45, while the standard deviation is 0.4362. This means that Manchester City's start correlates to a Z-score of 2.17, while Chelsea's 2008/09 start translates to a Z-score of 1.64. Those Z-scores can then be translated to percentiles, which communicate what percentage of teams in first place after 13 matches would finish with a lower goal differential per match given the variation we've seen in the first 16 seasons. Manchester City's percentile is 98.5, while the 2008/09 Chelsea performance was in the 94.9 percentile. That is to say that Manchester City's goal differential performance is better than 98.5% of the expected first place performances at the 13 match mark over time, and is 4.5% higher/better than the 2008/09 Chelsea start to the season.
A similar story can be told when it comes to PPM. Manchester City has started the season with a 2.69 PPM average, narrowly besting the 2.62 PPM start realized by the 2005/06 Manchester United and 2006/07 Chelsea squads. Taking into account historical average (2.37 PPM) and standard deviation (0.178 PPM) data, the corresponding Z-scores for City and Chelsea/United are 1.80 and 1.39, respectively. This puts City's percentile at 96.4 and Chelsea/United's percentile at 91.8. Manchester City's PPM performance is better than 96% of the expected performances by first place teams at the 13 match mark over time, and is 4% higher/better than the next closest performances on record.
Projecting City's Pace through the End of the Season
But will Manchester City be able to maintain this level of performance throughout the season? History suggests not.
Along with capturing Statto.com's data at the 13 match mark, similar data was captured for each season's final table. That final table data was then compared to the match day 13 data to examine what happens to PPM and GD per match pace by the end of the season. The results are the regression analyses captured in the graphs below (click on either graph to enlarge).
The solid lines represent the nominal regression equation based upon the black points from the previous sixteen seasons, while the dashed lines represent the bounds of the 50th percentile prediction intervals (PIs) given the variation seen in the regression model. Those bounds represent the range of the middle 50% of the data, while another way to interpret the lower bound is to state that a club only has a 25% chance of finishing with a PPM or GD per match total lower than that line given the same statistic's value after 13 matches. Manchester City's 2011/12 pace is represented by the light blue dot on each graph, with its y-value set to the projected season end value based upon the nominal regression line. Despite the lower R-squared values both data sets met the statistical tests for linear correlation, and passed the statistical tests for regression fits and residuals.
While the slope term of the PPM regression is slightly greater than one (1.0028), the intercept term (-0.3086) is larger than the resultant product of any match day 13 PPM and the slope term. This suggests there is a slight fall off to be expected by season's end when it comes to point accumulation. Based upon City's current PPM, the nominal prediction is a finish of 2.39 PPM and a 50% PI of 2.22 to 2.58 PPM. Manchester City has only a 25% chance of finishing with a PPM lower than 2.22, which is certainly championship material as it would put them right in the middle for PPM of clubs who have won a Premier League championship.
Far less sustainable is City's start in relation to GD per match. The slope term in the GD per match regression is much less than one (0.689) while the intercept term is only slightly positive (0.0598). This means there is a much bigger fall of in GD per match than in PPM. Based upon City's current GD per match, the nominal prediction is a finish of 1.70 GD per match and a 50% PI of 1.44 to 1.99. Manchester City has only a 25% chance of finishing with a GD per match lower than 1.44 - only four EPL champions have finished with a higher GD per match.
Projecting City's Odds of Finishing As Champions
So where does this put City's odds of winning the EPL Championship? Certainly more sophisticated models could look at strength of remaining schedule, odds of injury, distractions from other competitions, and many other attributes that contribute to a full season of results. Ironically, a far simpler model that looks at City's PPM can help quantify their odds of finishing at the top of the table.
Using the Statto.com data, a binary logistic regression (BLR) model was created. The outcome used in the model was "top of table" vs. "not top of table". Both the constant and variable (PPM) terms tested statistically significant. The plot below shows the nominal and 50% PI lines demonstrating the odds of winning the Premier League when the first place team at match day 13 has a range of PPM values. Again, Manchester City is represented by a light blue dot with it's y-value set to the projected odds based upon the nominal regression equation. Click on the graph to enlarge.
The BLR model's nominal projection is a 94% chance of City winning the EPL Championship. The 50% PI bounds are 85% and 98%, respectively. Manchester City's odds of winning the EPL in 2011/12 are very good based upon their start to the season.
Conclusions
While Manchester City's performance will certainly drop off as the season progresses, there is no doubting how good of a start they've had so far. They're not only off to the best start in the history of the Premier League, they're also off to a start that would be better than 96% to 98% of the all the teams that would be in first place after 13 matches. They're also highly likely to win the league. That being said, Manchester United is only 5 points behind and Tottenham is only 7 points back with game in hand. Things are far from over, and I will check back in at the 2/3's point in the season (26 matches) to see if City has kept up their fast start.












