Righties and Lefties in the Deuce and Ad Courts

Italian translation at settesei.it

Taking the next step beyond yesterday’s post about servers’ success rates in the deuce and ad courts, along with each in-game score, let’s look at the tendencies of righties and lefties.

As I speculated yesterday, righties are more successful in the deuce court (64.0% to 62.3% of points won), while lefties are better serving to the ad court (63.0% to 62.3%).  The difference for lefties is a little more dramatic (62.8% to 61.8%) if we remove Rafael Nadal from the sample.

(In all of the numbers today, I’ll present lefties in two forms: with Nadal, and without Nadal.  While Rafa is just one player, he makes up nearly one-third of the service points played by lefties in the dataset we’re working with of 2011 grand slam men’s singles matches tracked by Pointstream.  As we’ll see, Nadal appears to have some tendencies that separate him from the lefty pack.)

This would seem to give lefties a bit of a strategic edge; more than three-quarters of break points are played in the ad court, including all of the break points (30-40, 40-AD) that bring the server back to even.  If lefties are more likely to win those points, they would seem to be more likely to fend off such threats.  Of course, it might cut both ways: A weakness in the deuce court may lead to more break points needing to be fended off.

Oddly enough, lefties do not seem to employ their advantage at the most common break point score, 30-40.  Both righties and lefties win 30-40 points at about 6% less frequently than they win points in general.  The most marked difference is at 40-AD, where righties win 10% fewer points than average, but lefties win only 3% fewer points than average.  Rafa accounts for a big part of that difference, probably attesting to his mental strength.  Without Rafa, lefties win 6% fewer points than average at 40-AD; still quite a bit better than righties.

(Remember, there’s a bias inherent in this approach.  If a server reaches 40-AD, the score itself reflects a disadvantage.  Federer and Isner don’t serve many 40-AD points, precisely because their serves are so dominant.  Verdasco, Fognini, and whoever is playingDjokovic serve more 30-40 and 40-AD points, meaning that the sample of 40-AD is disproportionately full of men serving against the odds.)

There’s much more to do here, but in the meantime, these broad differences between righties and lefties give us plenty to think about.

The table below shows all the numbers described above.  The columns with numbers between 0.85 and 1.1 indicates, for each type of server (righty, lefty, etc.), how their performance at a specific point compares to their overall rate.  That allows us to better compare righties (winners of 63.1% of service points) with lefties-minus-Nadal (winners of 62.3%).

SCORE    WIN%       RH               LH            LH-xRN          
ALL     62.9%    63.1%            62.7%             62.3%          
DC CT   63.5%    64.0%   1.01     62.3%   0.99      61.8%   0.99   
AD CT   62.3%    62.3%   0.99     63.0%   1.00      62.8%   1.01   

SCORE    WIN%       RH               LH            LH-xRN          
g0-0    63.4%    63.5%   1.01     63.5%   1.01      63.1%   1.01   
g0-15   60.6%    60.8%   0.96     59.5%   0.95      59.5%   0.96   
g0-30   62.0%    62.6%   0.99     59.4%   0.95      60.6%   0.97   
g0-40   54.8%    54.1%   0.86     56.3%   0.90      54.6%   0.88   

g15-0   63.8%    63.7%   1.01     65.3%   1.04      64.8%   1.04   
g15-15  63.4%    63.8%   1.01     59.5%   0.95      59.6%   0.96   
g15-30  60.1%    60.2%   0.95     60.6%   0.97      61.4%   0.99   
g15-40  61.2%    61.5%   0.98     58.2%   0.93      58.3%   0.94   

SCORE    WIN%       RH               LH            LH-xRN          
g30-0   64.9%    65.0%   1.03     64.1%   1.02      63.0%   1.01   
g30-15  62.7%    62.5%   0.99     65.7%   1.05      65.7%   1.05   
g30-30  64.0%    64.3%   1.02     62.5%   1.00      60.9%   0.98   
g30-40  59.3%    59.2%   0.94     58.6%   0.94      59.3%   0.95   

g40-0   67.1%    66.8%   1.06     67.0%   1.07      65.1%   1.05   
g40-15  65.6%    66.0%   1.05     62.5%   1.00      61.6%   0.99   
g40-30  63.7%    63.6%   1.01     67.0%   1.07      65.6%   1.05   
g40-40  61.6%    61.9%   0.98     61.4%   0.98      62.2%   1.00   
g40-AD  57.8%    57.0%   0.90     60.5%   0.97      58.7%   0.94   
gAD-40  62.3%    62.4%   0.99     59.7%   0.95      61.4%   0.99

Point Outcomes by Game Score

Italian translation at settesei.it

If tennis players were machines, each player would have the same probability of winning every point.  Winning the point at 40-15 would be equally likely as winning the point at 15-40.  It seems a safe bet that this isn’t the case, and today I’m going to start talking about the difference, and why it exists.

To begin with, let’s look at the outcome of every grand slam men’s singles point in 2011, sorted by the score before the point was played.  (I’ll explain some of this in a minute.)

SCORE     PTS    WON   WIN%   REL  
g0-0    10757   6820  63.4%  1.00  
g0-15    3941   2390  60.6%  0.97  
g0-30    1552    963  62.0%  0.98  
g0-40     591    324  54.8%  0.88 

g15-0    6823   4356  63.8%  1.02  
g15-15   4858   3081  63.4%  1.00  
g15-30   2741   1648  60.1%  0.97  
g15-40   1416    866  61.2%  0.96  

SCORE     PTS    WON   WIN%   REL  
g30-0    4355   2826  64.9%  1.02  
g30-15   4609   2890  62.7%  1.01  
g30-30   3366   2155  64.0%  1.01  
g30-40   2080   1234  59.3%  0.95 

g40-0    2824   1895  67.1%  1.08  
g40-15   3819   2507  65.6%  1.03  
g40-30   3466   2209  63.7%  1.02  
g40-40   4556   2806  61.6%  0.97 

g40-AD   1749   1011  57.8%  0.93  
gAD-40   2806   1748  62.3%  1.00  

SCORE     PTS    WON   WIN%        
ALL     66309  41729  62.9%        
DC CT   34679  22024  63.5%        
AD CT   31630  19705  62.3%

One thing that sticks out is that as players get closer to winning a game (30-0, 40-0), they are more likely to win the next point.  When facing (or approaching) break point, they have less success.

Much of that (and maybe all of it) is simply the bias of the sample.  If a player reaches 40-0, he’s more likely to be a player who is dominant on serve, or facing a returner who hasn’t found the range.  A disproportionate number of 40-0 points are served by players who are better-than-average servers.  Similarly, a disproportionate number of 0-40 points are served by players without dominant service games … or served against Novak Djokovic.

Deuce and ad courts

A more useful finding is that players win more points in the deuce court.  In this sample, the server won 63.5% of points in the deuce court and 62.3% of points in the ad court.  This may be because right-handers (who make up about 85% of this sample) are more successful when serving across their body, but I haven’t tested that yet.

(If it is true that players are better serving across their body, then the difference is even more stark.  Assuming that righties and lefties have the same difference in success rates, the “serve across your body” success rate–deuce for righties, ad for lefties–should be about 63.8%, while the “serve away from your body” rate–ad for righties, deuce for lefties–should be 62.1%.)

Thus, the difference between success rates at 0-0 and 0-15 isn’t as extreme as it looks at first; some of the 0-15 winning percentage is due to the difficulty of serving to the ad court.  That’s the purpose of the ‘REL’ column, which shows how the winning percentage on that point relates to the average winning percentage in the relevant court.

If this difference is universally true, it would require a change in win probability tables.  For instance, when the returner reaches break point–which is more often in the ad court, at 30-40 or 40-AD–his chance of winning the game is a percentage point or two higher than previously estimated.  As long as he’s playing a right-hander, anyway.

There’s plenty more to investigate here.  To determine whether players really raise or drop their performance levels (for instance, raising their game against break point, or taking it easy at 40-0), we’ll need to switch to a player-by-player basis, to reduce the skewing effect of dropping every player in the same bucket.

Server’s Advantage: First and Second Serves

A couple of months ago I presented some research that showed that, in the average men’s grand slam match, the server’s advantage was neutralized somewhere between the 4th and 8th shot.

That research left a major question unanswered: How do the results differ between first and second serves?  Some second serves are hardly better than rallying shots, so it stands to reason that the server’s advantage is neutralized even faster on the second serve.

Using all of the Pointstream-tracked men’s matches from this year’s grand slams, we have an enormous population of points, in which 63.2% of points were won by the server.  When the first serve went in, the percentage jumped to 71.7%.  If the first serve went out, the server’s chance of winning fell to 50.2%, then rose again to 56.1% if he landed his second serve.

On first serve points, the server’s advantage was not neutralized until at least the 8th shot of the rally, and perhaps not until the 9th or 10th.  On second serve points, however, the advantage was gone (or very nearly so) as soon as the returner got the ball back in play.

Below, find the exact percentages of service points won for rallies that reach various lengths.  In the table below, the ‘1’ row refers to points with at least one stroke (the serve) that went in.  A one-stroke rally is defined as an ace, service winner, or return error.  A two-stroke rally is defined as a point in which the return landed in but the server doesn’t get his second shot back in play.  Note that each individual percentage is biased in favor of the player (server or returner) who has the chance to put the point away; the point can be considered neutralized when the biased even out (e.g. 55, then 45, then 55, and so on).

Rally    All     1sts     2nds
0      63.2%
1      66.1%    71.7%    56.1%
2      50.3%    53.7%    45.6%
3      59.9%    63.6%    54.9%
4      46.4%    47.8%    44.6%
5      58.1%    60.3%    55.6%
6      44.9%    45.9%    43.9%
7      57.0%    58.2%    55.6%
8      44.5%    45.1%    43.9%
9      55.9%    56.4%    55.5%
10     44.1%    44.1%    44.0%
11     55.8%    55.9%    55.8%
12     43.2%    43.1%    43.3%
13     55.6%    55.6%    55.5%
14     43.5%    43.5%    43.6%
15     55.5%    55.3%    55.6%

The Effect of One More MPH

Italian translation at settesei.it

All else equal, increasing your first serve speed is a good thing … so how useful is it?  Earlier this week, I published some generic numbers, but those are far too crude to answer this question.

To get a better answer, we need to see what happens when specific players serve a little faster or slower.  Sometimes, players dramatically mix up serve speed (as with slice serves wide), but most of the time, each player stays within a fairly limited range defined by his own power and skill.

The algorithm I’ve employed is  fairly complicated, so I’ll give you the results first.

It appears that most players, if they increased their average serve speed by one mile per hour, would win 0.2% more first service points.  That’s not many–it’s not even one point in every match.  But every little bit helps, and according to my win probability models, winning 0.2% more first serve points can increase your chance of winning an even match from 50% to just short of 51%.  Except possibly at the extremes, that continues to be the case for 2 MPH, 3 MPH, and greater increases–so a 5 MPH increase takes that 50/50 match and turns it into a 54/46 contest.

(One assumption here is that all players respond to increases in serve speed the same way.  I’m sure that’s not true, but at this stage it’s a necessary assumption.)

The effect of a speed increase is even greater on ace and service winner rates.  Each additional MPH on a player’s serve increases his ace rate by about 0.4%, and his service winner rate by about 0.5%.

Now for the algorithm and some caveats.

Process

The algorithm was designed to control (to the extent possible) for different types of serving and playing styles, as well as the different average speeds to the deuce and ad court, as well as to different directions (wide, body, and T).

I used only US Open data, to avoid differences between surfaces and between the speed guns used at different events.  I used data only from the 18 players who had more than 150 first-serve points tracked by Pointstream.  For each of those players, I found their average first-serve speed for each of six directions: wide, body, and T to the deuce and ad courts.  Then, I randomly selected 150 of their first-serve points, and for each point, noted the difference between the point’s serve speed and the player’s average in the relevant court/direction.

Thus, every one of 2700 points was labeled 0 (average for that player/court/direction), or +1 (one mph above average), or -4, and so on.  That results in large pools of points with each label.  Many of the pools were too small for useful analysis, so I grouped them in sets of five: (-2, -1, 0, +1, +2), (-1, 0, +1, +2, +3), and so on.  The pools, then, were useful from about -15 to +15.

From there, I looked at  each of several stats (points won, aces, service winners) for each pool, and compared the rates from one pool to the next.  The results were somewhat erratic–in some instances, an additional mph results in aces or points won going down, but over the set of 31 pools, they generally went up.  The numbers presented above are the averages of each one-mph change.

Caveats

It’s not a very big sample, especially when separating serves into pools of 0, +1, +2, and so on.

One issue with the dataset is that the 18 servers were usually winning–that’s how they got enough first serves to merit inclusion.  Thus, the average returner in the dataset is below average.  That isn’t necessarily a bad thing–perhaps below-average returners respond to changes in serve speed the way above-average returners do–but without more data, it’s tough to know.

Another concern is what the numbers really tell us below about 5 mph slower than average.  The algorithm operates on the assumption that a 120 mph serve is the same as a 121 mph serve, only slower.  Comparing 120 and 121, that’s probably true.   But comparing 120 and 108–for the same player, serving in the same direction–it probably isn’t.  The 108 mph isn’t a simulation of what would happen if the player wasn’t as good; it’s probably a strategic choice, likely accompanied by some spin.

That said, the algorithm doesn’t directly compare 120 and 108, it compares 108 and 109, and perhaps in the aggregate, there is something useful to be gleaned from comparing a strategic spin first serve to an identical serve one mph faster.  In any event, limiting the range to between -10 and 10, or even -7 and 7, doesn’t change the results much.

Finally, the sample is completely inadequate to tell us what happens at the extremes.  The average player appears to improve his chances by adding another bit of speed, but does John Isner?  There may be a ‘sweet spot’ where a player can get maximum gains from an additional 1, 5, or 10 mph on his first serves, but beyond which, the gain is more limited.

US Open Serve Speed by Player

It’s time for more serve-speed research notes. Most of the matches at the 2011 U.S. Open were tracked by Pointstream, and serve speed was recorded for the vast majority of those points. The Open website published some serve speed numbers, but not as conveniently as I would like.

Below, find the average first and second serve speeds for every man who played three or more Pointstream-tracked matches. Oddly enough, the top and bottom of the list are held by Americans; John Isner is where you’d expect him, while Donald Young barely kept his first-serve average in the triple digits.

I didn’t expect to see nearly so much variation in the difference between first and second serve averages. Sure, Isner and Young are the endpoints in both lists, but David Nalbandian–below average on firsts–is third of 22 on seconds. To take another angle, both Marin Cilic and Jo-Wilfried Tsonga each have more than double the difference in averages than does either Alex Bogomolov or Fernando Verdasco.

(“M” is the number of matches tracked by Pointstream for each player.)

Player                 M  1sts  1stAvg  2nds  2ndAvg  
John Isner             4   313   124.5   125   106.2  
Andy Roddick           5   249   122.1   118   100.5  
Tomas Berdych          3    85   120.3    71    95.0  
Jo-Wilfried Tsonga     5   289   119.7   206    90.6  
Marin Cilic            3   125   118.7   121    86.3  
Janko Tipsarevic       3   148   116.5    84    90.5  
Roger Federer          6   355   115.6   186    94.6  
Juan Martin Del Potro  3   180   114.5    96    88.2  
Julien Benneteau       3   177   114.0    86    89.9  
Tommy Haas             3   211   113.9   124    94.1  
Novak Djokovic         7   421   113.7   226    91.4  

Player                 M  1sts  1stAvg  2nds  2ndAvg
Andy Murray            6   338   112.6   204    85.2  
Mardy Fish             4   231   112.4   165    88.0  
David Nalbandian       3   165   112.3   125    96.1  
David Ferrer           3   128   112.2    74    88.9  
Rafael Nadal           7   435   110.5   176    84.5  
Juan Monaco            3   167   109.4    70    90.4  
Gilles Simon           3   235   108.3   179    81.6  
Fernando Verdasco      3   175   107.3    72    92.6  
Alex Bogomolov Jr.     3   264   103.1    96    89.1  
Donald Young           4   213   101.9   111    80.6

The Effect of Serve Speed

Italian translation at settesei.it

All else equal, you want to serve harder. But how much does it really matter?

That’s a more difficult question than it sounds, and I don’t yet claim to have an answer. In the meantime, I can share the results of some data crunching.

In 2011 U.S. Open matches covered by Pointstream, there were more than 9,000 first serve points. The server won almost exactly 70% of those points. About 11% of points were aces, and another 24% were service winners.

To see the effect of serve speed, I looked at four outcomes: aces, service winners, short points (three or fewer shots), and points won. It’s no surprise that each type of results happens more on faster serves.

Below, find the full numbers for serves of various speeds. The finding that sticks out to me is the small change in service points won from the 95-99 MPH group to the 115-119 MPH group. It may be that the modest increase–put another way, the surprising success rate at 95-104 MPH–is a result of strategic wide serves, or the better ground games of the players who hit slower serves.

So as I said, there’s much more work to be done, identifying the effects of faster serves for individual players, looking at deuce/ad court differences (for righties and lefties), and the results on different serve directions.

MPH      SrvPts   Ace%  SvcW%  Short%  PtsWon%  
85-89       140   2.1%  17.9%   47.1%    55.0%  
90-94       275   0.7%  21.5%   47.6%    63.6%  
95-99       546   2.2%  18.5%   48.4%    66.1%  
100-104     885   4.2%  24.6%   51.0%    66.0%  
105-109    1400   6.4%  29.3%   56.6%    68.7%  
110-114    1524   8.7%  34.0%   57.3%    69.1%  
115-119    1487  12.2%  35.9%   60.8%    69.4%  
120-124    1553  16.1%  40.1%   65.2%    73.2%  
125-129     941  21.5%  48.1%   72.4%    76.3%  
130-134     353  29.7%  58.4%   77.3%    84.4%  
135-139      66  27.3%  65.2%   80.3%    89.4%

Quantifying Comebacks and Excitement With Win Probability

Italian translation at settesei.it

As promised the other day, there’s a lot we can do with point-by-point and win probability stats for over 600 grand slam matches.

I’ve beefed up those pages a bit by borrowing some ideas from Brian Burke at Advanced NFL Stats.  He invented a couple of simple metrics using win probability stats to compare degrees of comebacks and the excitement level of (American) football games.

The concepts transfer to tennis quite nicely.  Comeback Factor identifies the odds against the winner at his lowest point.  I’ve defined it the same way Burke does for football: CF is the inverse of the winning player’s lowest win probability.  In the US Open Federer/Djokovic semifinal, Djokovic’s win probability was as low as 1.3%, or 0.013.  Thus, his comeback factor is 1/.013, or about 79.  That’s about as high a comeback factor as you’ll ever see.

On the other end, comeback factor cannot go below 2.0 — that’s the factor if the winning player’s WP never fell below 50%.  Matches in which the winner dominated are often very close to 2.0, as in the Murray/Nadal semifinal.  In that match, Nadal’s low point was facing a single break point at 2-3 in the first set; the comeback factor is 2.3.

A good way to think about comeback factor is this: “At his lowest point, the winning player faced odds of 1 in [CF].”

Excitement Index is a measure of volatility, or the average importance of each point in a match.  “Volatility” measures the importance of each individual point; EI is the average volatility over the course of a match.

(Burke sums the volatilities, reasoning that in football, a fast-paced game with many plays is itself exciting.  Since there is no clock in tennis [not exactly, anyway], it seems appropriate to average the volatilities.  Win probability already considers the excitement and importance of a deciding final set.)

At the moment, I’m calculating EI by multiplying the average volatility by 1000.  The Murray/Nadal match is 35 (not very exciting, though Murray fought back), the Djokovic/Federer match is 47 (more on that in a minute), while the 2nd rounder between Donald Young and Stanislas Wawrinka is 64.  I haven’t looked at all the matches yet, but EI should generally fall between 10 and 100, possibly exceeding 100 in rare instances like the Isner/Mahut marathon.

It seems like Djok/Fed should be higher, perhaps because we remember the excitement of the final set.  (And it may be that the final set should be weighted accordingly.)  But looking at the match log, there were an awful lot of quick games, which translate to relatively low volatility.  By contrast, Donald/Stan was more topsy-turvy throughout, as the players traded sets, then send volatility through the roof with a pair of breaks midway through the final set.

Both EI’s scaling and its exact definition are works in progress.  When I get a chance, I’ll do a survey of matches for which I have point-by-point data to further investigate both of these new (to tennis) metrics.

Win Probability Graphs and Stats

Win probability graphs and stats are now available for over 600 grand slam matches from 2011.  Thanks to IBM Pointstream from this year’s slams, there is a wealth of data available like never before.

Here’s the main menu.

Here’s a sample match: The US Open semifinal between Federer and Djokovic.

When I first started publishing tennis research, win probability was one of my focuses.  You can find earlier work here, which links to specific tables for games, sets, and tiebreaks.  I’ve also published much of the relevant code, which is written in Python.

Win probability represents the odds of each player winning after every point of the match, based on the score up to that point and which player is serving. It makes no assumptions about the specific skill levels of each players, but does assume that the server has an advantage, which varies based on surface and gender.  With every point, each player’s win probability goes up or down, and the degree to which it rises or falls is dependent on the importance of the point–at 4-1, 40-0, winning the point is nice, but losing the point just delays the inevitable; at 5-6 in a tiebreak, the potential change in win probability is huge.

To quantify that in the graphs, I show another metric: Volatility, which measures the importance of each point. It is equal to the difference in win probabilities between the server winning and losing the following point. 10 percent is exciting, 20 percent is crucial, and 30 percent is edge-of-your-seat stuff.

Assumptions

To produce these numbers, I needed to make several simplifying assumptions.  Some are more important than others; here are the big two:

  • The players are equal.
  • Each player’s ability does not vary from point to point.

The first of these is almost always false, and the second is probably false as well.  The first, however, makes things more interesting.  In most matches Novak Djokovic plays these days, he goes in with an 80-percent-or-better chance of winning.  If we graphed one of his matches starting at 85 percent, we’d usually get a very slowly ascending line.  Instead, by starting at 50 percent, we can see where he and his opponent had their biggest openings, and who took advantage.

(In this long-ago post, I showed a sample graph with an assumption similar to the 85 percent for Djokovic, and you can see some of what I mean.)

Assuming that the players are equal also sidesteps of messy question of how to quantify each player’s skill level on that day, on that surface, against that opponent.

The second big assumption ignores possibility real-world attributes like clutch performance and streakiness, along with more pedestrian considerations like some players’ stronger serving in the deuce or ad court.

Another long-ago article of mine suggests that servers are not absolutely consistent, possibly because of natural rises and falls in performance, also possibly because of risk-taking (or lack of concentration) in low-pressure situations.  One of the most interesting directions for research with these stats is into this inconsistency: We need to figure out whether some players are more consistent than others, whether “clutch” exists in tennis, and much more.

One more set of assumptions regards the server’s advantage.  Since these graphs only encompass the four grand slams, I set the server’s win percentage for each tournament.  The numbers I used for men are: 63% in Australia, 61% at the French, 66% at Wimbledon, and 64% at the U.S. Open.  I used percentages two points lower for women at each event.

More on Win Probability

There’s very little out there on win probability and volatility in tennis.  I wasn’t the first person to work out the probability of winning a game, a set, or a match from a given score, but as far as I know, I’m the only person publishing graphs like this.  Much of the problem is the limited availability of play-by-play descriptions for professional tennis.

That problem doesn’t apply to baseball, where win probability has thrived for years.  Here’s a good intro to win probability stats in baseball, and fangraphs.com is known for its single-game graphs–for instance, here’s tonight’s’s Brewers game.  In many ways, win probability is more interesting in baseball than in tennis.  In tennis, there are only two possible outcomes of each point, while in baseball, there are several possible outcomes of each at-bat.

Enjoy the graphs and stats!

The Speed of Every Surface

Italian translation at settesei.it

Last week, I wrote an article for the Wall Street Journal noting the relatively slow speed of this year’s U.S. Open.  It’s not clear whether the surface itself is the cause, or whether the main factor is the humidity from Hurricane Irene and Tropical Storm Lee.  For whatever reason, aces were lower than usual, creating an environment more favorable to, say, Novak Djokovic than someone like Andy Roddick.

The limited space in the Journal prevented me from going into much detail about the methodology or showing results from tournaments other than the slams.  There’s no word limit here at Heavy Topspin, so here goes…

Aces and Server’s Winning Percentage

Surface speed is tricky to measure–as I’ve already mentioned, “surface speed” is really a jumble of many factors, including the court surface, but also heavily influenced by the atmosphere and altitude.  (And, possibly, different types of balls.)  If you were able to physically move the clay courts in Madrid to the venue of the Rome Masters, you would get different results.  But teasing out the different environmental influences is little more than semantics–we’re interested in how the ball bounces off the court, and how that affects the style of play.

So then, what stats best reflect surface speed?  Rally length would be useful, as would winner counts–shorter rallies and more winners would imply a faster court.  But we don’t have those for more than a few tournaments.  Instead, I stuck with the basics: aces, and the percentage of points won by the server.

Important in any analysis of this sort is to control for the players at each tournament.  The players who show up for a lower-rung clay tournament are more likely to be clay specialists, and the men who get through qualifying are more likely to be comfortable on clay.  Also, the players who reach the later rounds are more likely to be better on the tournament’s surface.  Thus, the number of aces at, say, the French Open is partially influenced by surface, and partially influenced by who plays, and how much each player plays.

Thus, instead of looking at raw numbers (e.g. 5% of points at Monte Carlo were aces), I took each server in each match, and compared his ace rate to his season-long ace rate.  Then I aggregated those comparisons for all matches in the tournament.  This allows us to measure each tournament’s ace rate against a neutral, average-speed surface.

The Path to Blandness

The ace rate numbers varied widely.  While the Australian Open and this year’s US Open were close to a hypothetical neutral surface speed, other tourneys feature barely half the average number of aces, and still others have nearly half-again the number of aces of a neutral surface.   I’ve included a long list of tournaments and their ace rates below; you won’t be surprised to see the indoor and grass tournaments on the high end and clay events at the other extreme.

But there’s a surprise waiting.  I also calculated the percentage of points won by the server, and like ace rate, I controlled for the mix of players in every event.  While ace rate varies from 53% of average to 145% of average, the percentage of points won by the server never falls below 90% of average, rarely drops below 95%, and never exceeds 105%.  53 of the 67 tournaments listed below fall between 97% and 103%–suggesting that surface influences the outcome of only handful of points per match.

That may defy intuition, but think back to the mix of players at each tournament.  Big-serving Americans don’t show up at Monte Carlo, while South Americans generally skip every non-mandatory event in North America.  The nominal rate at which servers win points varies quite a bit, but that’s because of the players in the mix.

Also, this finding suggests that, as a stat, aces are overrated.  They may be a useful proxy for server dominance–if a players hits 15 aces in a match, he’s probably a pretty good server–but they come nowhere near telling the whole story.  Aces on grass turn into service winners on hard courts, and then become weak returns and third-shot winners on clay.  The end result is usually the same, but Milos Raonic is a lot scarier when the serves bounce over your head.

Finally, it would be a mistake to say that a variance of 3-5% in serve points won is meaningless.  It may be less than expected, but especially between good servers, 3-5% can be the difference.  Move Saturday’s Federer/Djokovic semifinal to a surface like Wimbledon’s, and we’d be looking at a different champion.

All the Numbers

Here is the breakdown of ace rate and serve points won, compared to season average, for nearly every current ATP event.

Since I am using each season’s average, you may wonder whether the averages themselves have changed from year to year.  I’ve read that courts are getting slower, but in the five-year span I’ve studied here, the ace rate has actually crept up a tiny bit.  Each tournament varies quite a bit–probably due to weather–but generally ends up at the same numbers.

Below, find the 2011 ace rate and percentage of serve points won, as well as the average back to 2007.   Again, these are controlled for the mix of players (including how much each guy played), and the numbers are all relative to season average.

The little letter next to the tournament name is surface: c = clay, h = hard, g = grass, and i = indoor.

Tournament          2011Ace  2011Sv%    AvgAce  AvgSv%  
Estoril          c    57.5%    96.6%     53.3%   94.3%  
Monte Carlo      c    52.0%    92.1%     53.9%   91.2%  
Umag             c    58.6%    95.2%     58.7%   94.3%  
Serbia           c    54.2%    93.5%     61.0%   94.8%  
Rome             c    62.5%    95.9%     62.9%   94.4%  
Buenos Aires     c    61.9%    99.0%     62.9%   98.6%  
Houston          c    64.9%    97.2%     66.6%   96.8%  
Valencia         i                       68.0%   96.4%  
Barcelona        c    55.7%    94.3%     68.0%   96.2%  
Dusseldorf       c    45.7%    96.5%     72.8%   97.2%  

Hamburg          c    78.0%    96.6%     74.3%   96.4%  
Bastad           c    63.8%    94.5%     76.8%   97.7%  
Roland Garros    c    78.0%    98.4%     77.1%   97.5%  
Santiago         c    84.5%    98.5%     81.5%   99.4%  
Costa do Sauipe  c    83.4%   101.7%     84.2%   98.9%  
Nice             c    88.5%    97.4%     84.3%   98.1%  
Casablanca       c    79.1%    99.0%     84.9%   98.2%  
Acupulco         c    70.9%    95.6%     86.0%   98.7%  
Madrid           c    77.0%    98.5%     86.1%   98.0%  
Munich           c    87.9%   100.1%     86.5%  100.0%  

Beijing          h                       86.7%   97.3%  
Los Angeles      h    84.7%    97.2%     87.7%   97.3%  
Kitzbuhel        c    95.8%    97.9%     89.0%   98.6%  
Toronto          h                       89.6%   98.3%  
Chennai          h    82.3%    98.0%     89.6%   98.7%  
Stuttgart        c    77.0%    95.8%     89.7%   98.1%  
Indian Wells     h    88.9%    99.0%     90.9%   98.0%  
Doha             h   125.5%   101.9%     91.2%   97.6%  
Auckland         h   103.1%   102.0%     93.9%   98.7%  
Miami            h    94.5%    97.9%     94.4%   98.0%  

Shanghai         h                       94.6%   98.1%  
Australian Open  h    97.6%    97.3%     96.5%   96.9%  
Kuala Lumpur     h                       97.1%   97.3%  
Sydney           h   105.8%   100.0%     97.4%   99.1%  
St. Petersburg   i                       97.8%  101.7%  
Montreal         h    91.3%    98.4%     98.1%   98.2%  
Delray Beach     h   106.2%    99.9%     99.1%   98.6%  
Gstaad           c   104.5%   100.1%    101.2%  101.4%  
Dubai            h   102.7%    96.5%    103.2%   98.2%  
US Open          h   101.3%    97.4%    104.0%   98.7%  

Vienna           i                      105.8%  101.4%  
Johannesburg     h   110.0%   102.7%    106.0%  101.0%  
Washington DC    h    97.5%   100.1%    106.8%   99.8%  
Newport          g    93.3%    99.0%    107.5%  101.7%  
Winston-Salem    h   108.1%    99.6%    108.1%   99.6%  
Atlanta          h   110.0%   100.9%    108.4%   99.0%  
Bangkok          h                      110.5%  101.6%  
Cincinnati       h    96.2%    98.9%    111.7%  100.5%  
Zagreb           i   107.0%    99.2%    112.3%  102.3%  
Moscow           i                      113.0%  101.3%  

Brisbane         h   130.6%   100.3%    113.4%  100.0%  
Eastbourne       g   111.2%   101.8%    114.1%  102.9%  
Paris Indoors    i                      115.4%   99.6%  
Rotterdam        i   123.8%   103.7%    115.9%  101.0%  
Basel            i                      117.7%  101.3%  
San Jose         i   108.6%   103.0%    120.0%  102.7%  
Wimbledon        g   119.4%   102.8%    120.7%  103.0%  
Queen's Club     g   113.3%   101.8%    121.5%  103.2%  
Halle            g   122.9%   104.7%    123.2%  102.5%  
Marseille        i   127.4%   102.8%    124.2%  102.2%  

Stockholm        i                      124.4%   99.8%  
Metz             i                      124.6%  101.7%  
Tokyo            h                      124.7%  100.5%  
s-Hertogenbosch  g   110.9%   102.1%    126.3%  104.0%  
Memphis          i   117.1%   101.2%    129.1%  102.0%  
Montpellier      i                      145.4%  104.5%