In this first football article, we place words on the numbers of football analytics. The first part of the article gives the actual definitions, while the second shows an example for the 2012-13 Premier League season.
Eventually, I play with the numbers to understand some surprising facts about the relegated teams of last year.
When I searched for football numbers on the Web, I found lots of blogs using many abbreviations to describe the stats of the game. I found a list of definitions on the StatsBomb blog, but I wanted to do my own list to see whether I understood everything correctly. So I summed up all the metrics for which I discovered a clear explanation in blog articles.
Here are the definitions:
Metric | Abb. | Meaning |
---|---|---|
Points Per Game | PPG | actual result |
Total Shot Ratio | TSR | game dominance |
PDO | PDO | relative shot/save efficiency (regarding other teams) |
Goal Ratio | GR | goal scoring ability |
Shots On Target Ratio | SOTR | ability to create serious attempts |
Conversion Ratio | CVR | ability to score goals from attempts |
On Target Conversion Ratio | OTCVR | ability to score goals from serious attempts |
Save Ratio | SVR | ability to save serious attempts from opponent |
Cards Per Game | CPG | fair play |
Then, to compute these values we use the corresponding formulas:
Metric | Formula |
---|---|
PPG | mean(points_for) |
TSR | shots_for / (shots_for + shots_against) |
PDO | OTCVR + SVR |
GR | mean(goals_for) |
SOTR | shots_on_target_for / (shots_on_target_for + shots_wide_for) |
CVR | goals_for / (shots_on_target_for + shots_wide_for) |
OTCVR | goals_for / shots_on_target_for |
SVR | (shots_on_target_against – goals_against) / shots_on_target_against |
CPG | mean(cards_against) |
To show the real appearance of the data, let’s apply these definitions to the last Premier League season (2012-13). The dataset I use is a sample of different databases from Soccerway and Football Data UK. For more information about the process, see the programming articles about football stats extraction. Below is a table with all the values.
<tr><td>2</td>
<td>Manchester City</td>
<td>2.053</td>
<td>0.625</td>
<td>1.012</td>
<td>1.737</td>
<td>0.582</td>
<td>0.113</td>
<td>0.204</td>
<td>0.808</td>
<td>1.368</td>
</tr>
<tr><td>3</td>
<td>Chelsea</td>
<td>1.974</td>
<td>0.552</td>
<td>1.073</td>
<td>1.974</td>
<td>0.573</td>
<td>0.140</td>
<td>0.242</td>
<td>0.831</td>
<td>1.105</td>
</tr>
<tr><td>4</td>
<td>Arsenal</td>
<td>1.921</td>
<td>0.598</td>
<td>1.018</td>
<td>1.895</td>
<td>0.592</td>
<td>0.126</td>
<td>0.207</td>
<td>0.812</td>
<td>1.132</td>
</tr>
<tr><td>5</td>
<td>Tottenham Hotspur</td>
<td>1.895</td>
<td>0.634</td>
<td>0.955</td>
<td>1.737</td>
<td>0.606</td>
<td>0.122</td>
<td>0.214</td>
<td>0.741</td>
<td>1.211</td>
</tr>
<tr><td>6</td>
<td>Everton</td>
<td>1.658</td>
<td>0.572</td>
<td>0.983</td>
<td>1.447</td>
<td>0.573</td>
<td>0.098</td>
<td>0.175</td>
<td>0.809</td>
<td>1.421</td>
</tr>
<tr><td>7</td>
<td>Liverpool</td>
<td>1.605</td>
<td>0.626</td>
<td>1.025</td>
<td>1.868</td>
<td>0.578</td>
<td>0.128</td>
<td>0.215</td>
<td>0.810</td>
<td>1.132</td>
</tr>
<tr><td>8</td>
<td>West Bromwich Albion</td>
<td>1.289</td>
<td>0.462</td>
<td>1.024</td>
<td>1.395</td>
<td>0.560</td>
<td>0.126</td>
<td>0.228</td>
<td>0.796</td>
<td>1.421</td>
</tr>
<tr><td>9</td>
<td>West Ham United</td>
<td>1.211</td>
<td>0.437</td>
<td>0.974</td>
<td>1.184</td>
<td>0.558</td>
<td>0.103</td>
<td>0.164</td>
<td>0.809</td>
<td>1.711</td>
</tr>
<tr><td>10</td>
<td>Swansea City</td>
<td>1.211</td>
<td>0.482</td>
<td>0.981</td>
<td>1.237</td>
<td>0.549</td>
<td>0.102</td>
<td>0.168</td>
<td>0.813</td>
<td>1.474</td>
</tr>
<tr><td>11</td>
<td>Norwich City</td>
<td>1.158</td>
<td>0.433</td>
<td>0.974</td>
<td>1.079</td>
<td>0.552</td>
<td>0.107</td>
<td>0.184</td>
<td>0.790</td>
<td>1.289</td>
</tr>
<tr><td>12</td>
<td>Fulham</td>
<td>1.132</td>
<td>0.427</td>
<td>1.044</td>
<td>1.316</td>
<td>0.560</td>
<td>0.120</td>
<td>0.226</td>
<td>0.818</td>
<td>1.184</td>
</tr>
<tr><td>13</td>
<td>Stoke City</td>
<td>1.105</td>
<td>0.419</td>
<td>0.968</td>
<td>0.895</td>
<td>0.488</td>
<td>0.103</td>
<td>0.198</td>
<td>0.770</td>
<td>2.000</td>
</tr>
<tr><td>14</td>
<td>Southampton</td>
<td>1.079</td>
<td>0.550</td>
<td>0.951</td>
<td>1.289</td>
<td>0.529</td>
<td>0.110</td>
<td>0.229</td>
<td>0.722</td>
<td>0.974</td>
</tr>
<tr><td>15</td>
<td>Aston Villa</td>
<td>1.079</td>
<td>0.431</td>
<td>0.968</td>
<td>1.237</td>
<td>0.561</td>
<td>0.125</td>
<td>0.240</td>
<td>0.728</td>
<td>1.474</td>
</tr>
<tr><td>16</td>
<td>Newcastle United</td>
<td>1.079</td>
<td>0.511</td>
<td>0.924</td>
<td>1.184</td>
<td>0.530</td>
<td>0.102</td>
<td>0.183</td>
<td>0.741</td>
<td>1.816</td>
</tr>
<tr><td>17</td>
<td>Sunderland</td>
<td>1.026</td>
<td>0.412</td>
<td>1.005</td>
<td>1.079</td>
<td>0.596</td>
<td>0.115</td>
<td>0.189</td>
<td>0.816</td>
<td>1.579</td>
</tr>
<tr><td>18</td>
<td>Wigan Athletic</td>
<td>0.947</td>
<td>0.484</td>
<td>0.890</td>
<td>1.237</td>
<td>0.564</td>
<td>0.111</td>
<td>0.185</td>
<td>0.705</td>
<td>1.395</td>
</tr>
<tr><td>19</td>
<td>Reading</td>
<td>0.737</td>
<td>0.352</td>
<td>1.019</td>
<td>1.132</td>
<td>0.509</td>
<td>0.133</td>
<td>0.246</td>
<td>0.773</td>
<td>1.079</td>
</tr>
<tr><td>20</td>
<td>Queens Park Rangers</td>
<td>0.658</td>
<td>0.451</td>
<td>0.927</td>
<td>0.789</td>
<td>0.536</td>
<td>0.074</td>
<td>0.139</td>
<td>0.788</td>
<td>1.421</td>
</tr>
Spot | Team | PPG | TSR | PDO | GR | SOTR | CVR | OTCVR | SVR | CPG |
---|---|---|---|---|---|---|---|---|---|---|
1 | Manchester United | 2.342 | 0.542 | 1.154 | 2.263 | 0.579 | 0.177 | 0.323 | 0.831 | 1.158 |
You can play with this table by many ways, and ask yourself some questions about what happened last season:
The last two questions are open ones, and they do not have a unique answer. However, we can use the numbers of the table to find some lines of research:
The solutions I proposed are just lines of inquiry and should be verified with tactical analysis, and further statistical analysis. However, we were able to find them without watching a single game, only by having a look at the the metrics.