Andy Murray and The Worst Upsets of the Year

On Tuesday in Montreal, Andy Murray played an ugly, listless match against world #35 Kevin Anderson, losing 6-3 6-1.  While Murray has played some solid matches this year and is in no immediate danger of losing his top-four ranking, the Anderson loss is hardly the first disaster of his season.  Back in Indian Wells and Miami, he managed to lose to Donald Young and Alex Bogomolov in successive matches.  Ouch.

Using my rankings and match projection system, I’ve generated win probabilities for every ATP match of the season.  Combined with match outcomes, that allows us to find the upsets that were least expected on that surface, at that time.

Pre-match, my system gave Anderson a 16.3% chance of beating Murray–only a smidge better than Dodig’s 14.4% against Nadal.  (My system has never given the South African much credit; his hard-court ranking right now is #58.)  In fact, Anderson was the 4th-biggest underdog going into the 2nd round, ahead only of Dodig, Michael Russell, and Vasek Pospisil.

As it turns out, Anderson’s victory was the 14th-biggest upset win of the ATP season.  (I took out retirements and “comeback players,” like Fernando Gonzalez and Tommy Haas, whose rankings aren’t very predictive.)  That’s 14 out of nearly 1,700.

But, as you might guess, 14th-best of the season isn’t enough to be 1st with Murray on the losing end.  The Murray loss to Young in March is the biggest upset of the year–Donald entered the match with an 8.6% chance of winning.  The Bogomolov match comes in 4th overall; the American had a 10.1% chance before play began.

Edit: This is what I get for writing a draft the night before!  Dodig’s upset victory comes in tied for #10 on the season, pushing Murray/Anderson down one more spot on the list.  Nadal will go home having suffered the biggest upset of the Rogers Cup, though he played far and away better than Murray did to achieve the same outcome.

The biggest upsets of the year

I couldn’t possibly give you those numbers without following through with a complete table.  Here are the 36 matches where the winner entered the match with less than a 20% chance of winning.  This list is through last week’s matches, so it doesn’t yet show Murray’s latest meltdown and Dodig’s shocker.

(This site doesn’t show wide tables very well; click here for a clearer version.)

P(UPSET)  WINNER                 LOSER               TOURNEY          SCORE
 8.6%     Donald Young           Andy Murray         Indian Wells     7-6(4) 6-3
 9.4%     Bernard Tomic          Robin Soderling     Wimbledon        6-1 6-4 7-5
 9.7%     Jimmy Wang             Igor Kunitsyn       Newport          4-6 7-5 6-2
10.1%     Alex Bogomolov         Andy Murray         Miami            6-1 7-5
11.6%     Stephane Robert        Tomas Berdych       French Open      3-6 3-6 6-2 6-2 9-7
13.1%     Milos Raonic           Mikhail Youzhny     Australian Open  6-4 7-5 4-6 6-4
13.8%     Denis Kudla            Grigor Dimitrov     Newport          6-1 6-4
14.0%     James Ward             Stanislas Wawrinka  Queen's Club     7-6(3) 6-3
14.1%     Federico del Bonis     Florian Mayer       Stuttgart        6-2 6-3
14.4%     Jo-Wilfried Tsonga     Rafael Nadal        Queen's Club     6-7(3) 6-4 6-1               

P(UPSET)  WINNER                 LOSER               TOURNEY          SCORE
15.0%     Jan Hernych            Thomaz Bellucci     Australian Open  6-2 6-7(11) 6-4 6-7(3) 8-6
15.2%     Nikolay Davydenko      Rafael Nadal        Doha             6-3 6-2
15.6%     Andrey Kuznetsov       Marcos Baghdatis    Casablanca       6-4 4-6 6-4
16.4%     Flavio Cipolla         Andy Roddick        Madrid Masters   6-4 6-7(7) 6-3
16.6%     Lukas Rosol            Jurgen Melzer       French Open      6-7(4) 6-4 4-6 7-6(3) 6-4
16.8%     Antonio Veic           Nikolay Davydenko   French Open      3-6 6-2 7-5 3-6 6-1
17.0%     Sergei Bubka Jr.       Daniel Gimeno       Doha             6-0 6-3
17.1%     Leonardo Mayer         Marcos Baghdatis    French Open      7-5 6-4 7-6(6)
17.6%     Ivan Dodig             Robin Soderling     Barcelona        6-2 6-4
17.6%     Michael Yani           Dudi Sela           Newport          7-6(5) 6-3                   

P(UPSET)  WINNER                 LOSER               TOURNEY          SCORE
17.7%     Frank Dancevic         Feliciano Lopez     Johannesburg     6-7(5) 6-2 7-6(8)
17.8%     Alexander Dolgopolov   Robin Soderling     Australian Open  1-6 6-3 6-1 4-6 6-2
17.9%     James Ward             Samuel Querrey      Queen's Club     3-6 6-3 6-4
17.9%     Jan Hernych            Sergey Stakhovsky   Halle            6-3 6-7(5) 7-6(8)
18.2%     Jan Hernych            Denis Istomin       Australian Open  6-3 6-4 3-6 6-2
18.3%     Jo-Wilfried Tsonga     Roger Federer       Wimbledon        3-6 6-7(3) 6-4 6-4 6-4
19.0%     Milos Raonic           Fernando Verdasco   San Jose         7-6(6) 7-6(5)
19.0%     Lukasz Kubot           Gael Monfils        Wimbledon        6-3 3-6 6-3 6-3
19.2%     Federico del Bonis     Sergey Stakhovsky   Stuttgart        6-4 6-3
19.3%     Lukasz Kubot           Nicolas Almagro     French Open      3-6 2-6 7-6(3) 7-6(5) 6-4    

P(UPSET)  WINNER                 LOSER               TOURNEY          SCORE
19.3%     Bernard Tomic          Feliciano Lopez     Australian Open  7-6(4) 7-6(3) 6-3
19.7%     Thomaz Bellucci        Andy Murray         Madrid Masters   6-4 6-2
19.7%     Pavol Cervenak         Victor Hanescu      Stuttgart        6-3 7-6(6)
19.7%     Richard Gasquet        Roger Federer       Rome Masters     4-6 7-6(2) 7-6(4)
19.9%     Rajeev Ram             Grigor Dimitrov     Atlanta          6-4 6-4
20.0%     Philipp Kohlschreiber  Robin Soderling     Indian Wells     7-6(8) 6-4

Prospect Rankings, 8 August 2011

I can’t believe it’s been three months and two grand slams since I’ve done one of these! Plenty has happened in the meantime, especially for Bernard Tomic, Wimbledon quarterfinalist. Tomic’s achievements have moved him into the top 100, into the top 3 20-and-unders, and the top 10 22-and-unders–quite a mark for an 18-year-old.

Note also that at the bottom of the 18-and-under list, there are a couple of 17-year-olds, plus Jiri Vesely, who only recently turned 18. I wouldn’t be surprised to see Vesely at the top of the 18-and-under list before his next birthday.

18 AND UNDER
68   Bernard Tomic                AUS   10/21/92  
326  Denis Kudla                  USA    8/17/92  
347  Diego Schwartzman            ARG    8/16/92  
356  Benjamin Mitchell            AUS   11/30/92  
390  Guilherme Clezar             BRA   12/31/92  
406  Tiago Fernandes              BRA    1/29/93  
418  Alexander Rumyantsev         RUS    8/16/92  
440  Roberto Carballes-Baena      ESP    3/23/93  
510  Jozef Kovalik                SVK    11/4/92  
531  Victor Baluda                RUS    9/30/92  
549  Jack Sock                    USA    9/24/92  
576  Taro Daniel                  JPN    1/27/93  
628  Micke Kontinen               FIN   12/18/92  
629  Jiri Vesely                  CZE    7/10/93  
648  Suk-Young Jeong              KOR    4/12/93  
669  Liam Broady                  GBR     1/4/94  
673  Edoardo Eremin               ITA  10/5/1993  
684  Jason Kubler                 AUS    5/19/93  
685  Mitchell Frank               USA   10/16/92  
698  Andres Artunedo-Martinavarr  ESP    9/14/93  

20 AND UNDER
26   Milos Raonic         CAN  12/27/90  
56   Grigor Dimitrov      BUL   5/16/91  
68   Bernard Tomic        AUS  10/21/92  
76   Ryan Harrison        USA    5/7/92  
140  Jerzy Janowicz       POL  11/13/90  
150  Cedrik-Marcel Stebe  GER   10/9/90  
185  Pablo Carreno        ESP   7/12/91  
197  Federico del Bonis   ARG   10/5/90  
209  Tsung-Hua Yang       TPE   3/20/91  
211  Facundo Arguello     ARG    8/4/92  
230  Javier Marti         ESP   1/11/92  
233  Marius Copil         ROU  10/17/90  
239  Laurynas Grigelis    LTU   8/14/91  
271  Axel Michon          FRA  12/16/90  
283  Gastao Elias         POR  11/24/90  
284  Alexander Lobkov     RUS   10/7/90  
305  Christian Lindell    SWE  11/20/91  
318  Daniel Cox           GBR   9/28/90  
319  Stefano Travaglia    ITA  12/28/91  
325  Andrey Kuznetsov     RUS   2/22/91  

22 AND UNDER
19   Juan Martin del Potro  ARG   9/23/88  
21   Alexander Dolgopolov   UKR   11/7/88  
26   Milos Raonic           CAN  12/27/90  
29   Marin Cilic            CRO   9/28/88  
48   Kei Nishikori          JPN  12/29/89  
55   Ernests Gulbis         LAT   8/30/88  
56   Grigor Dimitrov        BUL   5/16/91  
68   Bernard Tomic          AUS  10/21/92  
76   Ryan Harrison          USA    5/7/92  
89   Donald Young           USA   7/23/89  
107  Thiemo de Bakker       NED   9/19/88  
115  Thomas Schoorel        NED    4/8/89  
119  Benoit Paire           FRA    5/8/89  
134  Richard Berankis       LTU   6/21/90  
138  Martin Klizan          SVK   7/11/89  
140  Jerzy Janowicz         POL  11/13/90  
150  Cedrik-Marcel Stebe    GER   10/9/90  
155  Vasek Pospisil         CAN   6/23/90  
156  Evgeny Donskoy         RUS    5/9/90  
161  Vladimir Ignatik       BLR   7/14/90

Video: The Present and Future of Statistics in Tennis

Last Tuesday, I gave a talk at the Longwood Cricket Club in Boston about tennis statistics.  Many thanks to Rick Devereaux for extending the invitation, and to everyone at Longwood for their hospitality.  (And for their beautiful grass courts!)

In the talk, I discuss the value of different types of statistics in sports, what tennis stats are out there now, and what we can expect in the not-too-distant future. I also detour into baseball analysis to show some of the potential for research in tennis.  It’s about 36 minutes long.

Apologies for the video quality–the room was dark to accommodate the projector, and my handy little Flip camera could only do so much.  Still, the audio is generally clear.

Enjoy!

ATP Montreal Predictions

The big boys are back in action with this week’s Masters 1000 tournament in Montreal.  I’ve updated my rankings and generated some predictions for this week’s matches.  My system doesn’t give any credit to defending champions, so Andy Murray is in a distant third, while Djokovic’s chances of winning the tournament are lessened a bit by finding himself in the same half of the bracket as Federer.

If you’re visiting for the first time, or the first time since last week, you may be interested in the variety of content I posted over the weekend:

Once you’ve caught up, enjoy my forecast for this week’s Rogers Cup, below.

Player                        R32    R16     QF         W  
(1)Novak Djokovic          100.0%  81.1%  59.3%    23.27%  
Nikolay Davydenko           78.8%  17.5%   7.9%     0.70%  
(q)Flavio Cipolla           21.2%   1.4%   0.2%     0.00%  
Andreas Seppi               30.4%   6.9%   1.0%     0.02%  
Marin Cilic                 69.6%  27.5%   7.2%     0.61%  
Jarkko Nieminen             14.9%   4.9%   0.7%     0.01%  
(16)Juan Martin Del Potro   85.1%  60.6%  23.8%     5.21%  
                                                           
Player                        R32    R16     QF         W  
(12)Viktor Troicki          81.3%  40.4%  19.1%     0.49%  
(q)Michael Yani             18.7%   3.3%   0.6%     0.00%  
Marcos Baghdatis            57.5%  34.7%  17.9%     0.78%  
John Isner                  42.5%  21.7%   9.7%     0.19%  
(q)Alex Bogomolov Jr        41.7%  10.7%   3.4%     0.02%  
Adrian Mannarino            58.3%  19.1%   7.8%     0.10%  
(5)Gael Monfils            100.0%  70.3%  41.5%     2.19%  
                                                           
Player                        R32    R16     QF         W  
(3)Roger Federer           100.0%  91.7%  63.9%    15.48%  
Juan Ignacio Chela          52.4%   4.4%   0.9%     0.00%  
(WC)Vasek Pospisil          47.6%   3.9%   0.7%     0.00%  
(WC)Bernard Tomic           67.8%  29.9%   9.1%     0.51%  
(LL)Yen Hsun Lu             32.2%   9.3%   1.6%     0.02%  
Fabio Fognini               17.3%   5.3%   0.8%     0.01%  
(13)Jo Wilfried Tsonga      82.7%  55.4%  22.9%     2.84%  
                                                           
Player                        R32    R16     QF         W  
(10)Richard Gasquet         52.9%  34.9%  19.5%     0.60%  
Florian Mayer               47.1%  30.1%  16.9%     0.51%  
Andrey Golubev              42.9%  13.9%   5.3%     0.04%  
Thomaz Bellucci             57.1%  21.1%   9.1%     0.10%  
Sergiy Stakhovsky           38.9%  16.3%   6.6%     0.06%  
Philipp Kohlschreiber       61.1%  31.5%  16.4%     0.37%  
(8)Nicolas Almagro         100.0%  52.1%  26.1%     0.46%  
                                                           
Player                        R32    R16     QF         W  
(6)Mardy Fish              100.0%  67.4%  43.5%     3.65%  
Feliciano Lopez             53.7%  16.4%   8.0%     0.16%  
(SE)Radek Stepanek          46.3%  16.1%   7.0%     0.11%  
(WC)Ernests Gulbis          78.1%  41.7%  18.7%     0.56%  
Juan Carlos Ferrero         21.9%   5.3%   0.9%     0.00%  
Michael Llodra              46.3%  23.9%   8.8%     0.20%  
(11)Mikhail Youzhny         53.7%  29.1%  13.1%     0.44%  
                                                           
Player                        R32    R16     QF         W  
(14)Stanislas Wawrinka      59.5%  48.6%  20.7%     1.84%  
David Nalbandian            40.5%  30.2%   9.1%     0.39%  
(q)Michael Russell          37.4%   6.0%   0.7%     0.00%  
Albert Montanes             62.6%  15.2%   2.5%     0.02%  
Pablo Andujar               23.1%   1.4%   0.2%     0.00%  
Kevin Anderson              76.9%  12.7%   4.6%     0.08%  
(4)Andy Murray             100.0%  86.0%  62.1%    12.44%  
                                                           
Player                        R32    R16     QF         W  
(7)Tomas Berdych           100.0%  60.8%  37.2%     2.20%  
Alexandr Dolgopolov         93.0%  38.8%  21.6%     0.78%  
(WC)Erik Chvojka             7.0%   0.4%   0.0%     0.00%  
Ivo Karlovic                38.8%  15.2%   4.7%     0.04%  
Juan Monaco                 61.2%  27.4%  10.4%     0.21%  
(q)Philipp Petzschner       34.4%  16.3%   5.5%     0.07%  
(9)Gilles Simon             65.6%  41.1%  20.7%     0.88%  
                                                           
Player                        R32    R16     QF         W  
(15)Fernando Verdasco       72.5%  44.2%  12.1%     0.65%  
(q)Tobias Kamke             27.5%   9.8%   1.4%     0.01%  
Janko Tipsarevic            70.6%  36.4%   9.8%     0.39%  
(q)Alejandro Falla          29.4%   9.7%   1.4%     0.01%  
Ivan Dodig                  44.4%   6.5%   2.7%     0.04%  
Jeremy Chardy               55.6%   9.7%   4.6%     0.13%  
(2)Rafael Nadal            100.0%  83.8%  68.0%    20.10%

Do Points Get Shorter as the Match Progresses?

On Friday, some interesting ideas were batted around in the comments to my post on the 61-shot rally.  One of the simpler ones boils down to the question that titles today’s post: Do points get shorter as the match progresses?

Two forces seem to work in opposite directions:

  • As players get used to each other’s games (and specifically their serves), more balls get returned.  Before looking at the numbers, I would’ve bet that this was the case, meaning that aces and service winners decline as you go deeper into a match.
  • The longer the match, the more tired the players.  Tired (or even slightly injured) players take more risks and probably have shorter rallies.

To answer the question, I looked at rally lengths shown in Pointstream at the last three grand slams.  That gives us close to 250 men’s matches, all best-of-five sets.

The short, unsatisfying conclusion is: The results are mixed.  At Wimbledon and Roland Garros, rally length increased later in matches–as much as 10% in London and 20% in Paris.  At the Australian Open, the result was the exact opposite, with rally length decreasing substantially.  Perhaps rally length increases in most cases, except when it is extremely hot or the players are not yet in top shape.  The blistering heat in Melbourne is certainly a plausible reason for a decrease in rally length.

As we’ll see when we move into more specific findings, the results get even more jumbled.  It seems that points generally get longer as a match progresses, but not necessarily because players read and return serves better.  While rally lengths increase, the number of one-stroke points (aces, service winners, service return errors) often increases, as well.

Follow the jump for my methodology and full results.

Continue reading Do Points Get Shorter as the Match Progresses?

The Problem With “Unforced Errors”

Italian translation at settesei.it

In any sport, there are a handful of stats that are frequently cited, but are ultimately of limited use.  Often, these statistics tell you something, but are misunderstood to imply something more.  Simple examples are many “counting” stats — points scored in basketball, touchdowns thrown in football, RBI in baseball.  In all of those cases, they indicate something good, but don’t give you context — lots of field goal attempts, a great offensive line, or good hitters on base in front of you, to take those three cases.

The stat in tennis that aggravates me most is the unforced error.  Not only does it ignore some important context (as in the other-sport stats I just mentioned), but it relies on the judgment of a scorer.

Misjudgment

The second problem is the more problematic one.  How much does a number mean if two people watching the same match wouldn’t come up with the same result?  This was a hot-button issue during Wimbledon, when the scorers were assigning an unusually small number of UEs, especially on serve returns.

If you’re watching the match, you might not notice.  If the end-of-set stats show that Nadal had 8 UEs and Federer had 17, that does tell you something … Federer was making more obvious mistakes.  But if you want to compare that to a Nadal/Federer match three weeks ago, or last year, those numbers are all but useless.

I suspect that, at events like Wimbledon, someone from the ITF, or maybe IBM, is giving standardized instructions to scorers with general rules for categorizing errors.  That would be a good start, especially if it were implemented across all tournaments at all professional levels.

…but it doesn’t matter

I suspect that no matter how consistent scorers are, the distinction between “unforced” and “forced” errors will always be arbitrary.  Consider the case of service returns.  There are occasional points, especially on second serve returns, where the returning player misses an easy shot.  But more frequently, the returning player is immediately on defense.  When is an error “unforced” on the return of a 130 mile-per-hour shot?

Ultimately, we will probably have computerized systems that classify errors for us.  If you have all the necessary data and crunch the numbers, a 125-mph serve down the T in the ad court might be returned 60% of the time, meaning there is a 40% chance of an error or non-return.  With those numbers on every serve (and every other shot, eventually), we could set the line for an “unforced” error on a shot that the average top-100 player would make, say, 75% of the time.  Or we could have different classifications: “unforced errors,” “disastrous errors,” “mildly forced errors,” and so on, indicating different percentage ranges.

The problem we have now is that professionals are so good (and their equipment is so advanced), that almost every shot can be offensive, meaning that players are almost always–to some extent–on defense.  If you’re rallying with Nadal, you might hit some winners, but you’re always fighting the spin.  If you’re rallying with Federer, the spin isn’t so bad, but you’re always trying to keep the ball away from his forehand.  (If you’re rallying with Djokovic, you’re wishing you had hit a better serve.)  That perpetual semi-defensive posture means that nearly every error is, to some extent, forced.  And because players are so good, we expect them to return every reachable ball, suggesting that nearly every error is, to some extent, unforced.

Yikes!

The wisdom of baseball analysts

A very similar problem arises in baseball.  If a fielder makes a misplay (according to the official scorer), he is charged with an “error.”  Paradoxically, some of the best fielders end up with the highest error totals.  If, say, a shortstop has great range, he’ll reach a lot of groundballs, and have more chances to make bad throws, thus racking up the errors.

For decades, fans considered errors to be the standard measure of defensive prowess–a stat called “fielding percentage” measures the ratio of plays-successfully-made to chances.   (In other words, 1 minus “error rate.”)  But because of the paradox mentioned above, the highest fielding percentages do not necessarily belong to the best fielders.

The solution: Ignore errors, look only at plays made.  (This is an oversimplification, but not by much.)  If Shortstop A makes more plays than Shortstop B, it doesn’t matter whether A makes more errors.  The guy you want on your team is the one who makes more plays.

Essentially, baseball errors correspond to tennis unforced errors, and baseball plays-not-made (shortstop dives for the ball and can’t reach it) correspond to tennis forced errors.  The stat that ends up mattering to baseball analysts–“plays made”–corresponds to “shots successfully returned.”  The analogy is imperfect, but it illustrates the problem with separating one type of non-play from another.

Solutions

If we don’t distinguish between different types of errors, we’re left with “shots made” and “shots not made,” or–even less satisfactorily–“points won” and “points lost.”  Not exactly a step in the right direction, since we’re already counting points!

Still, I suspect it’s better to have no stat than to have a misleading stat.  Rally counts are a positive step, since we can look at outcomes for different types of points.  If you win a lot of 10-or-more-stroke rallies, that identifies you as a certain type of player (or playing a certain kind of match).  It doesn’t matter whether you lose that sort of point on an unforced error or your opponent’s winner–both outcomes might stem from the same tactical mistake three or four strokes sooner.

Either that, or we can wait until we can calculate real-time win probability and start categorizing errors with extreme precision.  “Unforced errors” aren’t going away any time soon, but as fans, we can be smarter about how much attention we grant to individual numbers.

The Simon/Monfils 61-Shot Rally: In Perspective

A couple of weeks ago, Gael Monfils and Gilles Simon made the unorthodox decision of extending their warm-up into the first game of the match.  Or somthing.  At 40-40 in the opening game, they counterpunched each other into oblivion, needing sixty-one shots before Monfils finally sent a slice long to end the point.

If you haven’t seen it, or you suffer from insomnia, click the link here.

What might be most remarkable about the rally is that, when Monfils made his error, there was no sign of the point drawing to a close — it isn’t hard to imagine those two hitting another 61 shots like that.  But even at 61, it’s an awfully long point.

So (asks the statistician) … how long was it?  Rally length is not widely available for ATP matches.  But thanks to IBM Pointstream, I do have rally length for each point on a Hawkeye court from the French Open.  (I’ve played around a bit with those numbers.)

From the French Open, we have roughly 20,000 men’s points to look at, which doesn’t count double faults.  About 35% of those points lasted only one stroke: an ace, a service winner, or an error of some sort on the return.  Only 15% of the points went 8 strokes or longer, and fewer than 10% reached 10 strokes.

In the entire tournament, only 12 rallies hit the 30-shot mark–only halfway to the Simon/Monfils level.  You won’t be surprised at most of the names involved in those dozen extreme points:

Mardy Fish    Gilles Simon       38  
Andy Murray   Viktor Troicki     37  
Gilles Simon  Robin Soderling    36  
David Ferrer  Sergiy Stakhovsky  33  
Andy Murray   Viktor Troicki     33  
David Ferrer  Gael Monfils       33  
Rafael Nadal  Pablo Andujar      32  
Tobias Kamke  Viktor Troicki     31  
David Ferrer  Sergiy Stakhovsky  31  
Rafael Nadal  Andy Murray        31  
Rafael Nadal  Pablo Andujar      30  
Andy Murray   Viktor Troicki     30

Both Simon and Monfils make an appearance, with Ferrer, Murray, and Nadal showing up multiple times.  What surprises me a bit are some of the guys who hung in there with the counterpunchers, especially Fish and Troicki.

In any event, 61 shots still stands out as a once-in-a-blue-moon accomplishment.

WTA rally length

Incidentally, you might suspect (as I did) that some WTA players would slug it out even longer.  Again using Pointstream data from the Hawkeye courts at the French, it turns out that ladies only reached the 30-shot threshold twice.  First, Marion Bartoli went to 33 against Olga Govortsova, and Na Li got to 32 shots against Silvia Soler-Espinosa.  The tongue-tying Wozniacki-Wozniak matchup comes in third, with a 28-stroke rally.

Wimbledon rallies

While we’re at it, let’s check the Wimbledon data.  Surprise, surprise–tied for the longest rally of the tournament is a 31-stroke exchange between Juan Martin del Potro and … Gilles Simon.  In fact, that match featured four of the 20 longest rallies of the tournament.

Also notable is Novak Djokovic, who reached 31, 30, and 29 against Bernard Tomic, and 25 (twice) and 24 against Marcos Baghdatis.

The true oddity in the top ten is John Isner and Nicolas Mahut, who somehow took a break from aces and errant groundstrokes to go 25-deep.  It was the  only point of the match that went longer than 12 shots.

 

Stuttgart, De-Seeded

At the Mercedes Cup in Stuttgart this week, only two rounds have been completed, and all eight seeds are gone.  It isn’t even a particularly weak top of the field–five of the eight seeds are ranked in the top 20, and all eight are 37th or better.

Six of the eight lost their first-rounders, including #1 Gael Monfils (to Hanescu) and #2 Jurgen Melzer (to Giraldo).  The remaining two seeds–#3 Mikhail Youzhny and #8 Guillermo Garcia-Lopez–lost today.  Youzhny may be the only man in the draw without something to be ashamed of–he won a match, then lost to Juan Carlos Ferrero on clay.

The remaining draw almost makes Newport look good.  Of the eight unseeded players, we have two wild cards (Cedrik-Marcel Stebe and Lukasz Kubot) and two qualifiers (Pavol Cervenak and Federico Del Bonis).  The two qualifiers will play each other tomorrow, so at least one man from the qualifying draw will reach the final four.

It’s a project for another day, but it would be interesting to see which tournaments are most upset-prone.  The post-Wimbledon clay circuit seems like a prime contender, if only because of its awkwardness on the schedule.  And as friend-of-HT Tom Welsh pointed out, there seems to be a post-Davis Cup swoon, evident at Stuttgart with the losses to Mayer and Monfils.

Rik De Voest, Man on the Cusp

You don’t have to read much of this site to know that I am particularly interested in the second tier of pros.  Some of that is due to spending countless hours at the U.S. Open qualifying tournament; the rest may be attributable to a general tendency to root for the underdog.  So, I tend to be as familiar with guys in the 140s of the rankings as I am with the men in the 40s.

One of those men is South African Rik De Voest.  If you’ve followed the ATP for long, you’ve doubtless seen his name.  He’s a lock for a wild card at the Johannesburg event, he plays many events on the U.S. challenger circuits, and he occasionally qualifies for other top-level tourneys.  He’s a strong all-around player, though perhaps mentally weak–I’ve seen him play a handful of times, and while he’s rarely blown out, he’s prone to giving up the lead.

The impetus for this mini-post is my discovery that Rik De Voest has never cracked the singles top 100.  He broke into the top 200 almost nine years ago, has not fallen out of the top 300 in that time, and reached a peak of 110 in 2006.  He turned 31 last month, so while he currently sits at 130, moving into double-digits gets more difficult every day.

I suspect that De Voest’s record as a sub-top-100 player is very uncommon.  Each year, many players reach the top 100 with nothing more than a handful of solid showings at challenger events–two of the many current players to fit that mold are Steve Darcis (#95) and Matthias Bachinger (#93).  While the top 100 may be a mental hurdle, the difference between 110 (De Voest’s peak) and 99 is almost meaningless.  In the rankings right now, it’s 17 points–less than the difference between winning and losing in the quarterfinals of many challengers.

Right now, about 80 points stand between the South African and the top 100.  That’s a taller order, but still an achievable one for a player of De Voest’s caliber over the course of a few months.  Depending on which statistical oddity you prefer, you may or may not want to root for him.  If he reaches the top 100, he’ll be one of the oldest players ever to do so.  If he doesn’t, he may well end up with the record for most weeks inside the top 200 (or 150, or 250, or 300) without ascending to the slightly-more-rarefied first page of the ATP singles rankings.

Doubly Lopsided Matches

Italian translation at settesei.it

On Sunday, Novak Djokovic beat Rafael Nadal by a somewhat unusual score: 6-4 6-1 1-6 6-3.  A four-setter in the final doesn’t raise any eyebrows, but a 1-6 set … that’s a bit of a head-scratcher, especially on a fast surface.  Wimbledon is better known for server domination, which means 6-4’s, 7-5’s, tiebreaks, and the occasional 70-68.

The Djokovic-Nadal score got me curious about two questions:

  1. How often does a player lose a set 1-6 (or even 0-6) yet still win the match?
  2. How often does a player both win and lose a lopsided (6-1 or 6-0 ) set?

(Note: Yes, sometimes a 6-1 set includes only two breaks, in which case it is similar to a 6-2 set.  Yet 6-1/1-6’s are far less frequent that 6-2/2-6’s.  It would be nice to distinguish “two-break” 6-1’s from “three-break” 6-1’s, but for now, all we can do is enjoy the trivia and accept the limitations.)

Bi-directional bagels

First things first.  As we might guess, scores such as these are extremely rare at Wimbledon.  This year, the final was one of only two such matches.  The other was Xavier Malisse’s second-round win over Florian Mayer, which went in the books as 1-6 6-3 6-2 6-2.  Last year, only one Wimbledon match qualified: a first-rounder between Victor Hanescu and Andrey Kuznetsov.  Oddly enough, Hanescu dropped the third set 1-6 after splitting two tiebreaks.  In neither of these matches did the winner take his own lopsided set, as Djokovic did.

In this department, Wimbledon remains unique among the majors–it isn’t just a matter of “clay” and “everything else.”  At this year’s Australian Open, there were eight matches with 1-6 or 0-6 scores; last year there were 11.  At the 2010 US Open, there were six.  These scores are more common at the slams, because the five-set format makes it more likely that the loser of an early set (by any score) can come back to win the match.

The numbers

Last year, there were roughly 2600 tour-level matches that were played to their conclusion.  (That is, neither player retired.)   Of those, about two-thirds were straight-set victories, leaving us with 871 matches that went three sets (or five, at the slams).

Of those 871, only 94 matches contained a 1-6 or 0-6 set, and only 30 included a “lopsided” set in favor of both players, as in the Nadal-Djokovic final.  Both have been somewhat less frequent so far this year; in 1546 matches, 48 saw the winner lose a lopsided set, and 11 saw both players lose a lopsided set.  Combining the two years of data, the likelihood that any given match will include a 6-1 (or 6-0) and a 1-6 (or 0-6) is almost exactly 1 in 100.  Again, the five-set format of the slams increases the probability a bit, while the fast courts at Wimbledon have the reverse effect.

The offenders

Which players find themselves in these roller-coaster matches?  To answer that question, we have to stick with the less-specific filter of matches that include a 1-6 or 0-6 set.  If we also require a 6-1/6-0 from the winner, there isn’t enough data to make things interesting.

One might guess that the strongest servers would be far down the list, while counterpunchers populate the top.  That isn’t the case.  The players who are known for mental lapses–regardless of their serving and returning skills–seem to dominate the upper tier.

Looking at all tour-level matches from 2007 through last week, we find that Andy Murray takes the cake.  He has played in 18 of these matches, dropping a lopsided set in 10 of his victories, while winning a lopsided set in 8 of his losses.  Murray is in a class by himself–number two on the list is Guillermo Garcia-Lopez, at 13.  In third place is Djokovic, with 12 (he is 8-4 in such matches), though the Wimbledon final was the only occurence so far in 2011.

Twelve men are clustered at 10 and 11 of these matches, and the list features a lot of Frenchmen, and several other players known for questionable mental strength:

  • 11: Julian Benneteau, David Ferrer, Fabio Fognini, Fernando Verdasco
  • 10: Thomaz Bellucci, Mardy Fish, Richard Gasquet, Paul-Henri Mathieu, Phillipp Petzschner, Tommy Robredo, Radek Stepanek, Jo-Wilfried Tsonga

Of these, Fognini (9-2) and Tsonga (8-2) have the dubious honor of winning the most matches–that is, they are on the list because they drop lopsided sets in matches that they win.  Mathieu (2-8) is at the other extreme, dominating sets in the middle of losses.

The Wimbledon final was a rarity for Nadal–it was only the fourth time he’d been involved in a match with this sort of score, and it was only the second time he won a lopsided set in the middle of a loss.  Roger Federer has only played in three such matches.

We probably can’t read too much into these numbers, but it is interesting to see so many of the same types of players show up at the top of a list.  At the very least, we’ve learned that the 1-6 set in Sunday’s final was quite rare, and the 6-1 1-6 sequence was even rarer.