Beneath The Surface Of Tennis

When trying to forecast the outcomes of our decisions, one of the more widely used mental models differentiates between first-order and second-order consequences. First-order consequences are the more obvious, surface-level ones. They also tend occur relatively early after the decision is made. Understanding second-order consequences requires a deeper analysis, looking beyond the obvious. Second-order consequences also tend to manifest themselves after a certain period of time has elapsed following the decision has been made.

For example, let’s say a tennis player decides to skip a warm-up at the beginning of a training session. Through a first-order lens, this might be a positive; the player has an extra 10 or 15 minutes to work on her strokes. But through a second-order lens, the skipped warm-up looks more like a negative; the player increases her chances of injury, is not physically prepared for the first few drills of the session, and might learn to place a lesser importance on her physical preparation overall.

A few examples of first-order thinking in the tennis world that have recently caught my eye are:

Don’t work on overheads, you never hit them in matches. This statement is true on the surface. In matches, we hit way more groundstrokes, serves, and returns than we do overheads. Practice time is limited, so why waste it on a stroke that we might hit once a set. If we look a bit deeper though, we realize that a player who doesn’t spend any time on her overheads will not be confident in that stroke. As a result, that player won’t be comfortable coming to the net during the matches. Spending time on the overhead, even though it is a relatively infrequently hit shot, is an important aspect of developing attacking tennis players with a well-rounded game.

Focus only on singles. When playing tournaments, doubles can often be an afterthought. After all, the majority of practice time is traditionally spent on singles, and very few juniors initially aspire to be outstanding doubles players. When looking a bit deeper though, if doubles is not taken seriously, we miss out on an opportunity to work on serves and returns – the two most important strokes in tennis – in a match situation. We don’t take advantage of thinking through movement, angles, and positioning in a different way than in singles. Finally, we don’t learn how to win and lose as a team, and how to communicate with our partner – all skills that extend beyond the tennis court.

Only practice with players better than you. Initially, that seems to make sense. When training with somebody who is better than us – hits the ball harder, is in better shape, can exploit our weaknesses effectively – we can clearly see which areas of our game need improvement. In a group setting, if we are one of the weaker players, it forces us to focus harder and push ourselves more than if we were one of the stronger players. Upon a deeper examination though, playing with weaker players has a lot of positives too. We can adjust our game style to work on things that are outside of our comfort zone. For example, serve and volley once a game. Play only through the middle. Tell yourself that you’re not allowed to hit winners and instead outlast your opponent. Play with only one serve. Start every game at 0-30. There are countless ways how to make practice productive when we are the stronger player.

I’m sure there are a lot of other examples of first and second order thinking in tennis. If you think of some good ones, leave them in the comments!

Nick Kyrgios The Tactician

It would be an understatement to say that Nick Kyrgios is a polarizing figure in the world of professional tennis. He is a Grand Slam champion, and one of the most talented athletes in our sport. At the same time, one can never be quite sure what is going to happen when watching one of his matches. Will we see a brilliant performance, worthy of a Top 10 player? Will we see indifference, broken rackets, and arguments with the chair umpire? One is as likely as the other.

In the second round of the Miami Open, we got to see the former. Kyrgios demolished Andrey Rublev, currently ranked #7 in the world, in 52 minutes by the score of 6-3 6-0. During the course of the match, we caught a glimpse of a side of Nick Kyrgios that doesn’t get spoken about very often: Kyrgios the tactician, executing a game plan to perfection.

Going into the match, Kyrgios knew that “…he’s (Rublev) a player who relies on a bit of rhythm, so I just tried to keep the points short and sharp, just play aggressive.” I want to highlight one way that Kyrgios set himself up to do just that.

If there is a knock on Rublev’s game, it is that his second serve is relatively slow. Below is a chart comparing the average serve speeds between Kyrgios and Rublev from their match:

Average Serve SpeedNick KyrgiosAndrey RublevDifference
1st serve207 km/h193.6 km/h13.4 km/h
2nd serve171.9 km/h143.8 km/h28.1 km/h
Courtesy ATPTour.com

Attacking Rublev’s second serve would be a great starting point to keep the rallies short and putting immediate pressure on the Russian. And Kyrgios did just that, with his positioning and impact point. This is the average return position of Kyrgios in the match when returning Rublev’s second serve:

Courtesy ATPTour.com

Two meters inside the baseline is about as aggressive as you can be with hitting the return on the rise and taking time away from the opponent. The beauty of this return strategy – when looking at it through the lens of Kyrgios’ overall tactic of keeping the points short – is that there is very little downside. If you hit a quality return, great! You’re most likely ahead in the rally, as there is a high chance that Rublev is rushed and off-balance on the first groundstroke after the serve. If you miss the return – the rally length was 1, and Rublev doesn’t get a chance to get into any sort of rhythm.

The strategy worked beautifully. Rublev won only 4 out of the 19 points played on his second serve (that’s 21%; generally speaking, we would like the server to be around the 50% range). Furthermore, almost three quarters of all points contested in the match were played in the 0-4 rally range:

Rally LengthKyrgios WonRublev WonTotal% of Total
0-442236573%
5-8971618%
9+2689%
Courtesy ATPTour.com

Just for comparison, here is Kyrgios’ second serve return position in his third round match against Fabio Fognini. He still returned from inside the baseline, but he toned down the aggressiveness:

Courtesy ATPTour.com

In the round of 16 match, Kyrgios is set to face Jannik Sinner. I would assume that Kyrgios won’t be looking to engage in prolonged baseline exchanges in that match either. His second serve return positioning will give you a hint as to the quality of Sinner’s second serve, as well as the style of match that Kyrgios will want this to be.

Carlos Alcaraz and Second Serve Placement

After the conclusion of the 2022 Australian Open, all the talk on the men’s side – and rightfully so – has been about Rafael Nadal and his 21st Grand Slam title. As much as Nadal has meant to the sport, it is no secret that the proverbial clock is ticking, and he is at the very tail end of his magnificent career. As one of the legends of tennis nears retirement, Spanish tennis has another young superstar waiting in the wings. I would never put the label of “heir apparent” to Nadal on anyone; Nadal’s achievements are unique and unmatched. But Carlos Alcaraz might be the next Spanish player to sit on the throne of men’s professional tennis.

Alcaraz is 18 years old, and ranked #29 in the world as of January 31st, 2022. If you haven’t seen him play, he is far from your typical Spanish clay court specialist. His is a complete attacking game, with the power to dial up a first serve in the 130mph range, control the rallies with his massive forehand, but also transition to the net and hit the occasional drop shot . The scary thing for the rest of the professional tennis world is that at 18 years of age, Alcaraz can only get better. And one area, where Alcaraz can elevate his game, is his 2nd serve placement.

Let’s use Alcaraz’ 3rd round match at the Australian Open against Matteo Berrettini as an example. A cursory look at the surface statistics doesn’t raise any red flags: Alcaraz won an exceptional 61% of his second serve points (anything over 50% is considered a good day in the office). Could this number have been even higher? This is the placement of all of Alcaraz’ 2nd serves from the match:

Courtesy Ausopen.com

On both sides of the court, Berrettini could have eliminated a third of the service box when he was receiving Alcaraz’ second serve. Alcaraz hit 51 second serves in the match, and only two of them were directed at Berrettini’s forehand. On the one hand, the tactic was working – as demonstrated by the 61% second serve winning percentage. On the other hand, it will free Berrettini to do something like this (at the 1 minute 46 second mark):

Berrettini breaks Alcaraz in the first game of the second set by forcing an error behind an aggressive inside-in forehand. What allowed Berrettini to set his feet and really unload on that forehand was the depth and quality of his backhand return. If Berrettini knows that Alcaraz will not serve his forehand in that situation, he can either run around the second serve to start the rally with a forehand return, or – like he did here – step inside the court, wait for the serve with a backhand grip, pick a return target before Alcaraz even hits the serve, and get on offense right away.

Was this maybe just a tactic against Berrettini? Is Alcaraz more unpredictable with his second serve location in other matches? Here is the placement of his second serves from this 2nd round match against Dusan Lajovic:

Courtesy Ausopen.com

Even in this match, Alcaraz served exclusively into the backhand and body of Lajovic. Alcaraz ended up winning this match rather easily in three sets, but even then the location of his second serves, from both sides of the court, was predictable.

How can a player get more comfortable with hitting a particular area of the service box with his second serve? The first step, as it usually does, happens in practice. You could play a practice set where you’re only allowed to hit second serves out wide in the deuce side, and down the T in the ad side. The second step would include hitting the second serve to that area in matches, in low leverage situations. For example, if you’re up 40-0 in a game, go for the second serve down the T. Even if you lose the point, you’re still heavily favored to win the game, serving at 40-15. Alcaraz won the first two sets against Lajovic 6-1 6-2. Firmly in control of the match, he still didn’t hit any second serves to the Lajovic forehand in the third set. Finally, the third step is hitting that serve in matches, in high leverage situations: tie-breaks, when facing break points etc.

For Carlos Alcaraz, the sky is the limit. He is one of my favorite players to watch and will be the face of men’s professional tennis for years to come. He already plays with an aggressive mentality from the baseline; harnessing that same mindset with his second serve locations should not be too difficult of a task.

Predicting wOBA Using Process-based Statistics

When trying to determine a batter’s overall offensive value using a single statistic, one of the most popular metrics to use is the weighted on-base average (wOBA). wOBA is calculated as a ratio of a linear combination of “outcome” statistics (unintentional walks, hit-by-pitches, singles, doubles, triples, and home runs) divided by, essentially, the number of plate appearances.

With that being said, could one predict whether a given player’s wOBA will be above a certain threshold using “process” statistics such as plate discipline and batted ball parameters? In particular, if we know a player’s, say, zone contact rate, chase rate, and average exit velocity, could we predict with any confidence whether that particular player’s wOBA will be above, say, .320?

Using Statcast data and a bit of machine learning, I have decided to train a shallow neural network to try to do just that. I will post snapshots of the Jupyter Notebook throughout the analysis to make it a little easier to follow.

Dataset

My dataset was downloaded from Statcast (creating a custom leaderboard) and included all qualified batter-seasons from 2015 until June 29th, 2021 – 2015 being the first year for which Statcast data are available. This resulted in a set of 989 player-seasons.

For every player-season, I collected the following nine statistics: wOBA, exit velocity average, barrel batted rate, zone swing percentage, zone swing and miss percentage, zone contact percentage, out-of-zone swing percentage, out-of-zone swing and miss percentage, and out-of-zone contact percentage.

I also created a column that was either “1” if wOBA was >= .320 for that particular player-season, and “0” otherwise. This would be the “true label” the neural network would try to predict. I picked the .320 threshold for wOBA, as that is roughly the league-average. In effect, the network would learn to differentiate between a below-average offensive performer, and an above-average one. Finally, I normalized all the column inputs, as, for example, the exit velocities are on a different scale than the statistics expressed in percentages.

Network Architecture

After a bit of trial and error, I settled on the following network architecture. The input layer had either six, seven, or eight units, depending on how many of the features I used in that particular scenario (this will make more sense further along in the analysis). Following the input layer, there were three fully connected layers with 8 units each, and a single-unit output layer, making the prediction. This is a binary classification problem – i.e. the network will make a prediction of either “1” if it thinks the wOBA of the batter will be greater than or equal to .320 given the input data, or “0” if it thinks the wOBA will be less than .320 – and so a single neuron in the output layer is sufficient. Below is a visual representation with six units in the input layer.

How did I arrive at 8 units in a hidden layer? Since at most I would use 8 input features, I picked that as the number of units in the first hidden layer. I wanted to keep the number of units consistent across layers for simplicity. And how did I decide on three hidden layers? I simply did a run with two hidden layers, and then one with three, and I got better results with three. Going to four started overfitting the training data, and so I settled on three hidden layers.

(For the sake of brevity I won’t go into detail of activation functions, regularization, loss functions etc. here in the body of the article. I will link to the code at the bottom, and feel free to hit me up for additional details.)

With the network architecture in place, I ran through four different scenarios, or four different combinations of input features, while keeping the network architecture constant. I’ll outline the results first, followed by a brief discussion.

Scenario #1: Plate Discipline Only

With what probability could we predict whether someone’s wOBA is over .320 using only plate discipline statistics, while knowing nothing at all about what happens when bat meets ball? This was my first scenario. In particular, the input features used in the training set were – all normalized – zone contact rate, zone swing rate, zone swing and miss rate, outside zone contact rate, outside zone swing rate, and outside zone swing and miss rate.

I had 80% of my overall dataset in the training set, and 20% in the test set. The network is trained on the training set, and the test set is used to gauge the accuracy of the network on data it hasn’t seen before. This resulted in 791 items in the training set, and 198 items in the test set. Here are the results after the network has learned its parameters following 15 passes through the training set:

Test set performance for scenario #1:

That’s about a 67% prediction accuracy on the training set, and about 69% on the test set. In other words, the probability that the network will be able to correctly predict whether a hitter’s wOBA will be above .320, using nothing more than their plate discipline statistics, is about 0.7. The fact that the training and test set accuracies are reasonably close – the test set accuracy actually being a bit higher – means that the network is not overfitting the training set either.

Scenario #2: Plate Discipline + Exit Velocity

While 70% is not a bad starting point, how much more accurate could the predictions of the network get if I added a feature with some actual batted ball information? For the second scenario, I added a seventh feature – the average normalized exit velocity. Here is the performance on the training set.

(As a side note, the training and test set splits were fixed for all the different scenarios. What this means is that the same 791 player-seasons were used in the training set every time.)

Test set performance for scenario #2:

The accuracy increased on both the training set and the test set; we’re now in the ballpark of 0.7 – 0.75 probability of the network making the correct prediction as to whether someone’s wOBA will be above .320 or not. Intuitively this makes sense: wOBA is calculated based on batted ball outcomes (and walks), and so adding a relevant batted ball parameter as a feature – such as exit velocity – should increase the accuracy of any wOBA prediction.

Scenario #3: Plate Discipline + Barrel Rate

Would using a barrel rate instead of the exit velocity lead to more accurate predictions? After all, the barrel rate combines two batted ball features – exit velocity and launch angle. Maybe the addition of the launch angle component would help improve accuracy. For scenario #3, I used seven features in the input layer again: the six plate discipline statistics, and the average normalized barrel rate. Here is the performance on the training set:

Test set performance for scenario #3:

The predictions of the network using the barrel rate as the seventh feature increased the accuracy of predictions compared to just using the plate discipline statistics alone, but they were less accurate than the predictions generated using the average exit velocity as the seventh feature. As to why average exit velocity led to better predictions than barrel rate – I’m guessing it’s because it is a more granular feature.

Let’s say Batter A hits three balls – a “barrel” at 97mph, and two “non-barrels” at 92mph. And let’s say Batter B hits three balls – a “barrel” at 97mph, and two “non-barrels” at 82mph. Their barrel rate will be the same, yet the average exit velocity will be different. Either way, the exit velocity provided the network with “more useful” information than the barrel rate did.

Scenario #4: Plate Discipline + Exit Velocity + Barrel Rate

For the final scenario, I used eight input features: the six plate discipline measures, the average normalized exit velocity, and the normalized barrel rate. Theoretically, this should lead to the most accurate prediction, as we’re adding the most detailed batted ball information to the plate discipline measures. This is the performance of the network on the training set:

Test set performance for scenario #4:

Utilizing all eight of the available features puts us in the ballpark of 80% accuracy of predictions. The fact that adding the barrel rate increased the accuracy as compared to the exit velocity alone, passes the smell test: while barrel rate contains some of the exit velocity information in it, it is sufficiently distinct from exit velocity so that it proved useful having it as a separate feature.

Summary & Discussion

ScenarioTraining Set AccuracyTest Set Accuracy
Plate Discipline66.92%69.19%
Plate Discipline + EV73.52%75.25%
Plate Discipline + Barrel70.16%71.72%
Plate Discipline + EV + Barrel77.57%81.31%

As it turns one can get about 80% of the way towards predicting whether someone will be an above average offensive contributor using their plate discipline statistics, their average exit velocity, and their barrel rate in this particular setup. One of the advantages of using a neural network is that the network is able to learn the various non-linear interplays between the input features. For example: let’s say a player has a relatively high out-of-zone chase rate. How high of an outside-of-zone contact rate would he need to have, keeping everything else constant, to get his wOBA over .320? Is it realistic? Or let’s say a player is currently sitting at a wOBA of .310. If we keep his plate discipline statistics constant, how much harder would he have to hit the ball to get his wOBA over .320? There are usually multiple avenues to improve a batter’s performance. Once the network is trained, its predictions can serve as a starting point in evaluating which of the avenues to explore, and which would require an improvement that might be beyond the batter’s reach.

To further improve the performance of the network past the 80% accuracy, there are two ways that one could take. Either change the network architecture, such as the number of hidden layers, the number of units in a layer, the activation functions etc. Or, use additional features that the network could find useful. For example, one could incorporate the percentage breakdown of pull-straight-opposite field hits for a batter. A batted ball with a certain exit velocity and launch angle hit directly over second base could be a single, while a batted ball with the same characteristics hit down the line could go for extra bases. Furthermore, since the test set accuracy actually exceeds the training set accuracy in all four scenarios, simply obtaining additional data is not likely to improve the network’s performance.

Finally, I’m sure that this is a baby version of what major league teams use. If the network’s output and the actual wOBA of a player disagree, the player could be candidate for regression, warrant a deeper dive into their data, or an additional look by the scouts. It would also be interesting to see, how effective minor league plate discipline and exit velocity data would be in predicting major league wOBA using a setup similar to this one.

For those interested, code for the neural network here.

Forehand: Offense, Backhand: Defense

By now it has been well established, that the majority of points in men’s professional tennis are shorter than 4 total shots, or 2 per player. Yet the most exciting spectacle in a tennis match is the extended rally, with players exchanging groundstrokes, battling for court position, and looking to exploit any small opening to gain the upper hand. Unreturned serves might be more common, but the long points earn the standing ovations and make the highlight reels.

With that being said, I wanted to see which groundstroke statistics correlated the most with actually winning the match in the 2021 French Open men’s singles main draw. Let’s use Novak Djokovic’s finals victory over Stefanos Tsitsipas as an example.

Courtesy RolandGarros.com

For the purpose of this analysis, I will group winners and forcing shots into one category : “offense.”

Looking at the table above, we see that Djokovic hit more forehand and less backhand groundstrokes overall than Tsitsipas did. Digging a bit deeper, Djokovic led the forehand offense category 47-32, but he also made more unforced errors on the forehand than Tsitsipas, 20-19. Switching over to the backhand wing, Djokovic is once again better than Tsitsipas in the offense category, 10-6, and he also made less backhand unforced errors than Tsitsipas, 10-13.

Which of these statistics have the most predictive power? In particular, if you didn’t know the outcome of the match, and could only pick a few of these comparisons to help you make an educated guess, which ones should you look at?

The groundstroke statistics are only available for 55 of the men’s singles matches contested in the French Open main draw, less than a half of the matches played. Despite the limited sample size, there are some interesting patterns worth exploring.

Forehand Raw Count

Let’s start with just looking at the number of forehands hit in a match. The forehand is understood to be the “sword” – an attacking weapon, while the backhand is more of a “shield” – a predominantly defensive tool in a player’s arsenal. Would it be enough just to see which player used the sword more?

Winner More Forehands HitLoser More Forehands HitEqual
Match Count30241

It turns out that just knowing who hit more forehands in a match provides little information about the outcome. In only about 55% of the matches in the dataset did the eventual winner hit more forehands than their opponent; slightly better than a coin flip. Intuitively, this makes sense: one, we don’t know anything about the outcome of those forehands. And two, some players are more comfortable with their backhand than others, utilizing, for example, the run-around forehand less, thus decreasing their forehand count. Daniil Medvedev would be an example of such a player.

To get more predictive utility, we’ll need to incorporate the outcome of the rally into the analysis.

Forehand Offense and Unforced Errors

Looking at the forehand outcomes first, this is how often the winner of the match amassed more combined winners and forcing shots than the opponent.

Winner More FH OffenseLoser More FH OffenseEqual
Match Count41113

Conversely, this is how often the winner of the match made less forehand unforced errors than the opponent.

Winner Less FH UELoser Less FH UEEqual
Match Count35173

It is not surprising that taking into account the outcome of the rally improves the predictive power of the statistic. Both the offense and the unforced errors are a significant improvement over the raw count of forehands. However, it is the offensive component that correlates more strongly with winning the match. About 75% of the time, the winner of the match amassed more winners and forcing shots on the forehand side, compared to about 64% of the time the winner made less unforced errors.

Why is it slightly easier to overcome a higher rate of unforced errors on the forehand and still win the match, as opposed to overcoming a lower offensive rate? I think that it has to do with controlling the rally. If I make unforced errors on the forehand, on at least some of those my feet are set, I am on offense, and I just go for too much and miss by a close margin. I can “get away” with those mistakes, as long as I keep generating the offense, getting ahead in the rally, and accumulating winners and forcing errors. Looking at the finals match, Djokovic made more unforced errors on the forehand than Tsitsipas; yet he led the offensive category by a significant margin, signaling that he was controlling more of the rallies with his forehand. It is this control that ultimately helped tilt the match in his favor.

Backhand Offense and Unforced Errors

Shifting gears to the backhand side, here is how often the winner of the match led in the offensive category.

Winner More BH OffenseLoser More BH OffenseEqual
Match Count28225

This was probably the biggest surprise for me in the dataset. Knowing, which player generated more winners and forcing shots on the backhand side told you almost nothing about the outcome in this particular set of matches. The winner of the match had more backhand winners and forcing shots only in about 51% of the matches. You would actually have a better chance predicting the winner of the match using the raw forehand count – a process statistic, as opposed to a backhand outcome statistic.

Why is that the case? My guess is that backhand winners and forcing shots are relatively infrequent events, similar to aces. They simply don’t account for a large enough percentage of points to bear a significant weight on the outcome of the match. In the present dataset, the match winners averaged about 20 forehand winners and forcing shots per match, but only about 10 backhand winners and forcing shots. Looking once again at the Djokovic vs Tsitsipas statistics, Djokovic had 47 winners and forcing shots on the forehand, compared to just 10 on the backhand.

If looking at the backhand offense tells us little, how about looking at backhand unforced errors?

Winner Less BH UELoser Less BH UEEqual
Match Count39142

The backhand unforced error rate was the second best predictor of the ultimate match winner, behind only the forehand offense. This is a great illustration of the backhand’s function as a shield. Since players in general don’t finish a ton of rallies with their backhand (as seen in the backhand offense table), it is important that the stroke be dependable in a neutral rally – i.e. limiting the unforced errors. The backhand is used more as a “linking” shot, a bridge between defense and neutral, and as a transition from neutral to offense before the forehand is used to finish the job. Its effectiveness as a link is better highlighted when looking at unforced errors – when the stroke is not under pressure – as opposed to looking at finishing statistics such as winners and forcing shots.

It would be great to see if these patterns held over all of the 120+ singles matches played at this year’s French Open. Regardless, the results of the analysis pass the smell test: to guess, who might have won a particular tennis match, look at who inflicted more damage with their forehand, and who hurt themselves less with their backhand. More often than not, you’ll be on the right track.

Ruminating on the Underhand Serve

Almost every sport has a set of unwritten rules that the competitors are expected to abide by and follow. In baseball, baserunners are discouraged from stealing bases in the late innings of a blowout. In American football, teams will often take a knee instead of running an offensive play if the game is already decided late in the fourth quarter. In soccer, a team in possession of the ball is expected to kick it out of bounds in order to allow medical treatment of an injured opponent. The application of the unwritten rules is nuanced and not universally agreed upon, even among the competitors themselves. The common thread among all these “agreements” though, is showing a level of respect for the opponent and the sport.

There are numerous examples of unwritten rules in tennis. For example, tennis players are expected to avoid trying to aim at their opponent with the ball during a rally, if there is an option to go around them. They are expected to apologize after they hit the net tape with a stroke, and the ball rolls over onto the opponent’s side. Players shake hands at the end of the match, regardless of how heated the competition might have gotten.

And then there is the underhand serve.

This is Dominic Thiem’s second serve return position in his third round match in the ATP Masters 1000 in Rome against Lorenzo Sonego:

Courtesy ATPTour.com

I’m certainly not picking on Thiem. There are plenty of other players, who prefer to hit their returns from way behind the baseline: Rafael Nadal, Daniil Medvedev, and Stefanos Tsitsipas all come to mind. The reason for this is purely tactical; it gives them more time to react to the serve, potentially hit a forehand on the return, and gain the upper hand in the rally.

If you knew nothing about tennis, you might think to yourself: if my opponent wants to return from that far back, it would seem logical for me to try and entice them to return from closer to the baseline. Just like if my opponent wants to hit more forehands, I will aim at her backhand. I am trying to get my opponent to do the things they are not comfortable doing on the court.

You do have a potential weapon in your arsenal to help you accomplish just that: the underhand serve. But you’re not supposed to use it. Take a look at the reaction Nick Kyrgios got from Nadal when he used the underhand serve during their Wimbledon encounter in 2019:

Regardless of the history between Nadal and Kyrgios, the underhand serve is regarded as a sign of disrespect towards the opponent; they are “not worthy” of you hitting a “proper” serve against them. I would argue that the underhand serve could become a legitimate tactic in an era where, especially on clay, many players choose to set up for the return way back behind the baseline. What would have to happen for the underhand second serve to become less taboo? I think either of the following three developments would speed up the process.

A Respected Ambassador

If someone like Nick Kyrgios or Alexander Bublik decides to serve underhand, the stigma associated with that shot is reinforced. They are “young and brash,” “disrespecting the opponent and the game,” and “immature.” Both Bublik and Kyrgios are known for having tanked matches, and making eyebrow-raising statements in some of their post-match press conferences; their underhand second serves are then viewed through that same lens.

If Roger Federer or Serena Williams decide to serve underhand, the lens would change. Both Federer and Williams have amassed so much credit over their respective years of dominating the sport, that their reputations are bulletproof. Do you still remember the SABR?

Contrast the crowd’s reaction to Federer’s unconventional tactic to the reaction Kyrgios got when hitting the underhand serve. Admittedly, it is not an apples to apples comparison. In the SABR, Federer is trying to get Djokovic out of his rhythm, sure. Yet he still puts himself at a bit of a disadvantage by decreasing the time he has to react to Djokovic’s serve by returning so close to the service line. In other words, during the SABR, you react. During the underhand serve, you are completely in control of the shot. Regardless of that particular difference between the SABR and the underhand serve, just listen to the different reactions of the commentators and the crowd. What would the reaction be if Federer served underhand to Nadal at this year’s Wimbledon? You tell me.

Genuine Tactic

There is one more subtle difference between the two tactics, and that is the timing of their deployment in the videos above. Notice the score when Kyrgios serves underhand to Nadal: 2-5, 40:0. Serving at 40:0 on grass, Kyrgios is an overwhelming favorite to win the game. Even if he loses the point, at 40:15 he is still well ahead in the game. By the same token, once he wins the point, Nadal is clearly favored to win the set when serving at 5-3. In a way, that 2-5 40:0 point is largely irrelevant to the outcome of the first set.

Contrast that with the timing of Federer’s SABRs in the video above. First one at 1-1 15:30, and the other up 3:1 in the tiebreak. Both are tremendously important points. If Federer goes up to 15:40 in the third game, he has a good chance of going up a break in the match. Similarly, to go up 4:1 in the tiebreak, and serving, puts Federer well on his way to securing the first set 7:6.

Looking at their respective strategies from this angle, it really does seem like Federer’s SABR was a legitimate tactic deployed to surprise Djokovic and gain an advantage in the match. Kyrgios’ underhand serve was used in a situation that really didn’t matter.

I think that if the underhand second serve is used as a real, genuine tactic, some of the players’ and fans aversion would be muted. What is a “genuine tactic?” That is hard to pinpoint, but some things to look out for would be:

  • Used early in the match to force the opponent to adjust
  • Used throughout the match if the opponent doesn’t adjust; for example, using the underhand serve twice a game instead of twice a set
  • Used on important points
  • Used in multiple matches against a variety of opponents

Returner is Not a Victim

This last point falls more on the fans and members of the media rather than the players themselves. And that is simply to recognize that the returner can adjust their position, if they don’t like being served to underhanded. Just like if Djokovic didn’t like Federer’s SABRs, he could hit his second serve a little harder, and aim it at the body of Federer. Once Federer sees that his strategy is not having the desired effect, he’ll stop doing it. If I don’t like somebody slicing backhands, I can hit through their forehand. Tennis is a game of adjustments, and allowing that one’s return position is a variable that the opponent might want to exploit would go a long way towards freeing up some players to hit more underhand serves without fearing the crowd’s reaction.

There are plenty of examples of tactics evolving in various sports around the world. In basketball, once the sport has recognized and embraced the value of the three point shot, the game has evolved into a wide-open, pace and space sprint, as opposed to the slow slog of yesteryear when games were dominated by battles in the post. In American football, the multiple wide receiver formations are much more prevalent in today’s era of the pass than the run-heavy, multiple tight end sets of the years past. In baseball, you might see the shortstop lineup anywhere on the infield on defense these days, depending on who the batter is. Seeing a little more of the underhand serve would be a welcome sight for yours truly; a wrinkle, and a new tactical element in a game that has recently been a little lacking in variety for my taste.

Alexander Zverev vs. Rafael Nadal: A Tale of Two Matches

En route to his recent ATP Masters 1000 title in Madrid, Alexander Zverev notched his first clay court victory over Rafael Nadal. Zverev beat Nadal twice before their Madrid encounter, but both victories came on indoor hard courts. Playing Nadal on clay is a different animal though. His confidence on clay is sky high, the higher bounces magnify Nadal’s ability to hit the ball with heavy topspin, and the slower conditions tend to make matches more physical, which Nadal relishes.

After Zverev took the match 6-4 6-4 in the quarter finals in Madrid, the two were set for a rematch a week later in the quarter finals of the ATP Masters 1000 in Rome. This time it was Nadal, who came away with the 6-3 6-4 victory. Looking at the statistics from both matches, we can catch a glimpse of the adjustments that Nadal made, and how he was able to tilt the way the match was played in his favor in Rome more so than in Madrid.

Extending the Points

Let’s first take a look at the distribution of the different rally lengths from the Madrid match, which Zverev won.

Total PointsPercent of TotalZverevNadalAdvantage
0-4 shots6559.63%3629Zverev +7
5-8 shots2926.61%1811Zverev +7
9+ shots1513.76%69Nadal +3
Courtesy ATPTour.com

Almost 60% of the points in Madrid were shorter than 4 shots, and over 85% were shorter than 8 shots. The average rally length for the match was 5.9 shots. It was in these shorter rallies where Zverev won the match. Once the rally extended past 9 shots, it was advantage Nadal; but since those rallies counted for less than 15% of all points, it was not enough to swing the match in Nadal’s favor.

Going into the match in Rome, I would assume that one of Nadal’s goals was to work his way into the point, extend the rallies, and make the match more physical than was the case in Madrid. Let’s examine the same statistics from Rome.

Total PointsPercent of TotalZverevNadalAdvantage
0-4 shots4434.92%2519Zverev +6
5-8 shots5039.68%2228Nadal +6
9+ shots3225.40%1220Nadal +8
Courtesy ATPTour.com

Mission accomplished for Nadal. Zverev still won the 0-4 shot rally length, but the percentage of total points in that particular bucket dropped from about 60% in Madrid to only about 35% in Rome. Conversely, the contribution of the 9+ rally length increased from about 14% in Madrid to over 25% in Rome. Combine those two, and the average rally length in Rome jumped to 7.5 shots compared to the 5.9 average in Madrid. Furthermore, Nadal was able to reverse the 5-8 shot rally length to his advantage, and maintain his dominance in the extended 9+ shot rallies.

How exactly was Nadal able to extend the rallies in Rome compared to Madrid? By going into Zverev’s backhand more, and by hitting spinnier, heavier groundstrokes.

Targeting the Zverev Backhand

Once the point got started, Nadal’s strategy in Rome was much more focused on hitting through the Zverev backhand than was the case in Madrid. Let’s look first at the comparison of Nadal’s forehand targets from both matches:

The left image are Nadal’s forehands in Madrid, where he lost, and the image on the right is Nadal’s forehands in Rome, where he won. There is about a 10% increase in Nadal’s forehands being aimed cross-court into Zverev’s backhand. The difference is yet more pronounced on Nadal’s backhands:

The left image is once again Nadal’s backhands from Madrid. In that match, Nadal hit 3 out of every 4 backhand cross-court into Zverev’s forehand. In Rome, Nadal made sure to stay away from that particular pattern more often, and his backhands were split about evenly between cross-court and down the line. Also, notice the higher frequency with which Nadal’s backhands were landing deeper in the court in Rome than was the case in Madrid. Depth is one of the best ways to gain control of the point in a rally, and it was surely a contributing factor in Zverev committing 31 rally unforced errors in Rome as compared to just 10 in Madrid.

Heavier Spin on Groundstrokes

Not only did Nadal get his groundstrokes – especially backhands – deeper into the court in Rome, his strokes were also “heavier,” hit with more topspin. Below is a table comparing the average RPMs on groundstrokes from both the Madrid and the Rome matches.

Nadal FHNadal BHZverev FHZverev BH
Madrid3,069 rpm2,378 rpm2,893 rpm1,529 rpm
Rome3,183 rpm2,578 rpm2,830 rpm1,500 rpm
Difference+114 rpm+200 rpm-63 rpm-29 rpm
Courtesy ATPTour.com

While Zverev was hitting his groundstrokes flatter in Rome than he did in Madrid, Nadal really cranked up the topspin, especially on his backhand wing. Knowing that balls with more topspin tend to bounce higher than flatter strokes, and combining that with the groundstroke distribution patterns, we can try to guess Nadal’s baseline strategy in Rome: try to get the ball up high on Zverev’s backhand, making it harder for him to attack from that part of the court. Now, Alexander Zverev is listed at 6’6″, so this is no easy feat.

It would be great to see the average net clearance of Nadal’s groundstrokes in the match, but I can’t seem to find that statistic on the ATP website. The best I can do is compare the average net clearance on Nadal’s second serve returns, which were hit about 0.81m above the net in Madrid, and about 20cm higher – 1.03m above the net – in Rome. It is certainly not hard evidence, but it points in the same general direction as the higher rpms on the groundstrokes: get the ball higher on Zverev.

Getting the ball deep and under heavy spin into Zverev’s backhand forces Zverev into a decision. Option one is to try to take the ball on the rise, before it gets up above his shoulders. The timing of that shot is tricky even for the best players in our sport, and given how flat Zverev’s backhand is, the margin for error is slim. Alternatively, Zverev can back up and wait until the ball drops back down into his strike zone. That, however, puts him in a defensive position way behind the baseline – minimizing his chance to attack – most likely extending the rally – and playing right into Nadal’s hands. Pick your poison time for Zverev.

Alexander Zverev was able to get the best out of Rafael Nadal in Madrid by keeping the majority of the rallies short. Nadal countered in Rome with a strategy designed to extend the rallies, and trying to get Zverev uncomfortable in the ad side of the court. Should these two face-off at the French Open, one of the factors deciding the outcome of the match will be the length of the rallies. If Zverev can keep the match a first-strike battle, he has a chance. Otherwise, with every extra groundstroke hit, Nadal’s advantage will keep mounting.

3 Suggestions for Jannik Sinner Against Rafael Nadal

After losing to Rafael Nadal in the 2nd round of the ATP Rome Masters 1000, Jannik Sinner is now 0-2 in his young career against the Spaniard. Having a losing record against the greatest clay court player in the history of our sport is nothing to be ashamed of; Sinner’s other loss to Nadal came on clay as well, in the quarterfinals of the 2020 French Open. Looking at the statistics from their Rome encounter, I think that there are three areas that Sinner could improve on to give himself a better chance when he sees Nadal for the third time.

One particular area, where Nadal was better than Sinner in Rome, was on points that started with a second serve. I included both columns for visual effect, even though a player’s 2nd serve win% is equal to (1 – 2nd serve return win%) of their opponent.

2nd serve win %2nd serve return win %
Rafael Nadal55%56%
Jannik Sinner44%45%
Courtesy ATPTour.com

Sinner’s Second Serve Placement

These are the locations where Sinner aimed his second serves against Nadal in Rome:

When Nadal is getting ready to return Sinner’s second serve, he can basically eliminate half of the service box in the deuce, and a third of the service box in the ad. In this particular match, it was a winning strategy for Sinner in the ad court, where he won two thirds of the points that started with his second serve down the “T”, but it was a losing proposition in the deuce side, where he lost two thirds of his second serve points that started out wide.

Where it gets really interesting is that Sinner’s plan here is to serve to Nadal’s backhand in both courts. Let’s take a look whether Nadal actually returned with the backhand once Sinner got his second serve in. First, the deuce court:

How many forehand returns did Nadal hit on Sinner’s second serve in the deuce? Zero. That’s the image on the left. All of Nadal’s second serve returns in the deuce were backhands, just like Sinner planned. Yet Nadal won 11 and lost 5 of those points. Not encouraging for Sinner. How about the ad side?

How many forehand returns did Nadal hit on Sinner’s second serve in the ad? All of them. That’s the image on the left again. All of Nadal’s second serve returns in the ad were forehands, yet he won 6 and lost 10 of those points. Much more encouraging for Sinner.

Let’s sum it up. Sinner was clearly targeting the Nadal backhand with his second serve in both sides of the court. He got as he wished in the deuce, and lost the majority of those points. He couldn’t find the Nadal backhand with the second serve in the ad, and won the majority of those points.

My biggest takeaway from this analysis would be for Sinner to hit more second serves out wide in the ad, but especially down the T in the deuce side next time he sees Nadal. It would prevent Nadal from zeroing in on one particular area of the service box, and Sinner has shown the ability to handle the Nadal forehand return by having a winning percentage against it in the ad side.

Sinner’s Second Serve Return

Sinner’s second serve returns often landed much too short in order to put Nadal on defense at the start of the point. Here is the breakdown of Sinner’s second serve return locations from the deuce side:

The forehand returns are the box on the left, backhand returns are on the right. Sinner did a good job with getting forehands on the majority of the deuce side returns: 9 forehands, and only 4 backhands. But notice that out of the 13 total second serve returns, only 3 were classified by the ATP as “deep.” Especially with the forehand returns, there is a cluster right in the middle of the court. Let’s take a look at the ad side:

The forehand returns are once again on the left, the backhands on the right. It’s a similar pattern to the deuce side: Sinner again does a good job of getting forehands on the second serve returns: 10 forehands and just 2 backhands. Yet out of those 12 returns, only 3 would be classified as “deep.” And once again, there are a bunch of forehand returns in the middle of the court.

Finally, this is where Sinner was hitting his second serve returns from:

Courtesy ATPTour.com

Let’s sum up this part of the analysis. Sinner was hitting his second serve returns mostly from about 4 meters behind the baseline, with one lone exception when he stepped inside the court in the ad side. Those returns were mostly forehands that were landing short, towards the middle of the court.

I think that Sinner has two options to make his second serve return more effective against Nadal. First, he can step inside the court more, and keep everything else more or less the same – just like that one dot in the picture above. Sure, it would be great to get the return deeper, but by taking time away from Nadal, even a shorter return can elicit a defensive response.

The second option – if Sinner feels comfortable hitting forehand returns from further behind the baseline – would be to simply return higher above the net. Here are the return heights from Nadal and Sinner from their Rome match:

Courtesy ATPTour.com

Sinner hit his second serve returns about 0.73 meters above the net. I’d like to see that number be closer to about 1 meter, if Sinner were to keep returning from way back behind the baseline. Just those additional 30 centimeters of net clearance would get the returns a bit deeper, and set up Sinner on offense at the start of the rally.

Rally Backhand Direction

Nadal crafted his biggest advantage in the match in the longest rallies. Here are the points won by both players in rallies of different lengths:

0-4 shots5-8 shots9+ shots
Rafael Nadal272322
Jannik Sinner232316
AdvantageNadal +4Nadal +6
Courtesy ATPTour.com

One way to keep the rallies shorter for Sinner – besides mixing up his second serve locations, and hitting his second serve returns deeper – would be to utilize his backhand down the line more. This is where Sinner aimed his backhand groundstrokes against Nadal:

Courtesy ATPTour.com

That’s about three out of every four backhands going into Nadal’s forehand. It’s actually pretty common to see this kind of a distribution when two right handed players are matched up against each other, and they’re trading backhands back and forth. Against a lefty though, it’s a different story.

Let’s contrast Sinner’s backhand placement to where Alexander Zverev aimed his backhands against Nadal when he beat him in the recent ATP Madrid Masters 1000:

Courtesy ATPTour.com

Zverev was much closer to a 50-50 split on his backhand than Sinner was, and having to respect Zverev’s backhand down the line undoubtedly kept Nadal more off balance in groundstroke exchanges. Sinner has a very simple, clean, two handed backhand, that he can hit close to 130km/h. The next time he plays against Nadal and finds himself in a groundstroke rally, I’d like to see him take more chances with his backhand down the line to the Nadal backhand. The majority of his backhands can still go crosscourt, but just like on the second serve, all he’s trying to do is show Nadal a different pattern of play, as opposed to being a little too predictable.

Sinner is one of the upcoming stars of tennis, challenging the “old guard” of Federer, Djokovic, and Nadal. Making a few small adjustments here and there is all that it might take for Sinner to claim his first win over Nadal the next time they lock horns.

The MVP Batter

In one of the later chapters of The MVP Machine, the authors describe a working relationship between an unnamed position player and a writer at an “analytically inclined” baseball website. The player felt that his club’s advanced scouting data wasn’t granular enough, and asked the writer to supplement the information he was given by the club with additional detail. The writer was eventually performing scouting reports on the player himself, opposing pitchers, as well as the home plate umpires’ strike zones. In terms of evaluating his own performance, the writer summarized that the player was basically looking at three things: “Am I squaring up the ball? Am I swinging and missing? Am I swinging at strikes?”

With the first month of the season in the books, who would be some of the best performing hitters in the league according to this particular player’s criteria? Thanks to Statcast, we have the tools at our disposal to try and figure out just that. The dataset I used for this exercise was all qualified batters as of the morning of April 30th, 2021.

First, we need to decide which parameters to use to represent each of the three questions posed by the player. Two of the three are pretty easy. “Am I swinging and missing?” We can look up a player’s whiff percentage on Statcast. “Am I swinging at strikes?” That information is represented in a player’s chase percentage. “Am I squaring up the ball?” The natural candidates here would be, if we’re using just one number: the average exit velocity, hard hit percentage, and barrel percentage. I decided to go with the average exit velocity, because it takes into account every batted ball put in play by the batter. Let me explain.

The hard hit percentage – defined as the percentage of balls hit with an exit velocity of 95mph or harder – is binary. If a batter puts 6 balls in play at 96mph, and 4 balls at 94 mph, his hard hit percentage will be 60%. Similarly, if a batter puts 6 balls in play at 96 mph, and 4 balls leave his bat at 85mph, his hard hit percentage will be the same 60%, even though the first batter makes a more consistent hard contact. Barrel percentage has the same binary issue, and I couldn’t find anywhere on Statcast what that “perfect combination” of exit velocity and launch angle is.

Now that we have our three parameters – average exit velocity, chase percentage, whiff percentage – I normalized all three to vary between 0 and 1, using the formula:

X_normalized = (x_true_value – dataset_min) / (dataset_max – dataset_min)

X_true_value is the actual value of the parameter, dataset_max is the highest value of that parameter in the dataset, and dataset_min is the lowest value in the dataset. Let’s look at an example. Shohei Ohtani’s average exit velocity – as of the morning of April 30th – was 91.1 mph. The highest average exit velocity in the dataset was 98.7mph, the lowest 81.6mph. Ohtani’s normalized average exit velocity would then be:

(91.1 – 81.6) / (98.7 – 81.6) = 0.5555

Finally, I computed the weighted average with different weights for the normalized parameters. For the normalized average exit velocity, the higher the number the “better,” while for the chase and whiff raters, the higher the normalized value the “worse.” To account for this, I’m actually multiplying the respective weights by (1 – normalized_whiff_rate) and by (1 – normalized_chase_rate). Let’s call the final output “MVP value.”

Base Case

MVP = (1/3) * normalized_exit_velocity + (1/3) * (1 – normalized_chase_rate) + (1/3) * (1-normalized_whiff_rate)

In the first instance, I assigned equal weights to the exit velocity, whiff rates, and chase rates. This is a “baseline” run, if you wish. The top 10 is as follows (excuse some rounding errors):

Last NameFirst NameNorm_EVNorm_OZ_SwingNorm_WhiffMVP
Acuna Jr.Ronald0.78950.11960.33570.7781
JudgeAaron0.92980.35280.52210.6850
LowrieJed0.56730.20250.34500.6733
PhamTommy0.54970.08900.44990.6703
StrawMyles0.27490.07980.18650.6695
SmithWill0.52050.23930.29140.6633
TroutMike0.67840.20860.48020.6632
TurnerJustin0.52630.19330.35430.6596
LeMahieuDJ0.45030.17480.30070.6582
RamirezJose0.59650.40800.22840.6534
Courtesy MLB Statcast

Ronald Acuna Jr. has been spectacular at the plate so far this year, and he is head and shoulders above everybody else in this edition of the made-up metric. Otherwise, we see a lot of players, who have been tearing the cover off of the ball early in the season. Besides Acuna, this would include Mike Trout, Aaron Judge, Justin Turner, Jose Ramirez etc.

Where it gets interesting is that we have two guys with sub-70 wRC+ on the list: Tommy Pham and Myles Straw. Let’s take a look at each one in turn.

I’m not going to lie, I had to look up who Myles Straw was. He is making an appearance courtesy of a 15.3% chase rate (MLB average in 2021 is 28.4%) and a 13.4% whiff rate (MLB average 24.4%). Unfortunately, his 86mph exit velocity with a 6.1 launch angle lead to a .325xSLG, in line with his brief major league career. Unless Straw can start making harder contact, his elite contact ability and plate discipline alone won’t keep him in the big leagues.

Tommy Pham, on the other hand, is about to go on a hot streak here soon. The underlying metrics are solid across the board, he’s just been unlucky so far. The average exit velocity is at 91.6 mph, and Pham’s wOBA sits at .256 as opposed to .380 xwOBA. Similarly, Pham is slugging .203, compared to a .483 xSLG, a 280(!!) point difference.

Let’s change the weights on the normalized values a bit. Maybe we’d like to emphasize not chasing, staying within the strike zone. We’ll bump up the weight on the normalized chase rate to 50%, and decrease the weights on exit velocity and whiff rates both to 25%.

No-chase

MVP = (1/4) * normalized_exit_velocity + (1/2) * (1 – normalized_chase_rate) + (1/4) * (1-normalized_whiff_rate)

Last NameFirst NameNorm_EVNorm_OZ_SwingNorm_WhiffMVP
Acuna Jr.Ronald0.78950.11960.33570.8036
StrawMyles0.27490.07980.18650.7322
PhamTommy0.54970.08900.44990.7305
MuncyMax0.35670.00000.51750.7098
LowrieJed0.56730.20250.34500.7043
LeMahieuDJ0.45030.17480.30070.7000
TurnerJustin0.52630.19330.35430.6964
TroutMike0.67840.20860.48020.6953
SmithWill0.52050.23930.29140.6876
GrossmanRobbie0.32160.10120.38460.6836
Courtesy MLB Statcast

We see a lot of the same names from the table above. Jed Lowrie has been another recipient of tough luck. His average exit velocity is up over 90mph for the first time in his career, yet his slugging percentage sits at .406 compared to .546 xSLG.

Max Muncy is one of two players to make an appearance in this table after not being in the top 10 in the base case. His average exit velocity of 86.8mph is at its lowest level since 2016, and his 49.2% ground ball rate is at its highest level since 2016 as well. Yet even he might “heat up” in the near future, as his .365 SLG vs .460xSLG would suggest. Muncy’s chase rate of 11.9% is the lowest in the majors, and is a major contributor to his 24.1% walk rate and a .422 on base percentage. No need to press, Max.

Let’s do one more iteration, this time emphasizing hard contact; the weights in this iteration will be 50% on the normalized exit velocity, and 25% on the whiff rates and chase rates respectively.

Hit-it-hard-somewhere

MVP = (1/2) * normalized_exit_velocity + (1/4) * (1 – normalized_chase_rate) + (1/4) * (1-normalized_whiff_rate)

Last NameFirst NameNorm_EVNorm_OZ_SwingNorm_WhiffMVP
Acuna Jr.Ronald0.78950.11960.33570.7809
JudgeAaron0.92980.35280.52210.7462
AlonsoPete0.94150.58280.58280.6794
TroutMike0.67840.20860.48020.6670
StantonGiancarlo1.00000.55830.79020.6629
Guerrero Jr.Vladimir0.74850.25150.61310.6581
LowrieJed0.56730.20250.34500.6468
FreemanFreddie0.66080.33130.41260.6444
PhamTommy0.54970.08900.44990.6401
RamirezJose0.59650.40800.22840.6391
Courtesy MLB Statcast

That would be 3-for-3 for Ronald Acuna Jr. as the top dog. Besides him, some of baseball’s most powerful sluggers make an appearance on the list. I’d like to touch on two of them, Pete Alonso and Giancarlo Stanton.

Pete Alonso’s average exit velocity jumped to 97.3 mph in 2021 from about 91mph in 2019 and 2020. He is doing more damage in the zone – his whiff and chase rates are in line with his career averages, but his zone swing rate jumped 10% from mid to high 60s in 2019/2020 to 75.8% so far in 2021. Alonso swings more in the zone than before, and makes more contact in the zone as well; his zone contact rate is at a career high 85%. That’s a recipe for success and Statcast seems to agree; Alonso is sporting a healthy .500 SLG, yet his xSLG is at .597.

Giancarlo Stanton is hitting the ball as hard as anyone, but there might be trouble lurking on the horizon. Stanton currently has the lowest zone contact percentage of his career, along with a 30%+ chase rate for the first time since 2016, and highest whiff percentage since 2015. Moreover, he has been destroying four-seam fastballs this year, having an xSLG of .773 against the pitch. Looking at breaking balls and off-speed stuff, his xSLG against sliders is .438, against curveballs it is .515, and against changeups .224. So far, Stanton has been seeing about 60% fastballs. It will be interesting to see if he starts seeing more breaking balls going forward and ends up closer to a 50-50 split for fastballs vs non-fastballs.

Besides learning that Ronald Acuna Jr. is an alien, looking at the three underlying “MVP” metrics combined seems like a good starting point to dig deeper into a player’s offensive profile. Adjusting the weights on the individual parameters emphasizes different skillsets of the batter, and allows us to identify candidates for regression or improvement in some of the surface stats.

Jannik Sinner & Lorenzo Musetti: Two Paths, One Destination

As of April 26th 2021, there are two teenagers ranked inside the ATP Top 100: Jannik Sinner and Lorenzo Musetti. They both represent Italy, both are right handed, and both are fantastic movers on the court. Yet there are notable differences between the two as well. Sinner has a relatively flat two handed backhand, while Musetti has an outstanding, spinny one hander. Sinner is calm, almost stoic on the court; Musetti is not afraid to pump his fist and show his emotions. Sinner has already established himself as one of the elite players in our sport – as of this writing he is ranked #18 in the world – while Musetti is just now becoming known to the casual tennis fan.

One other notable difference is the path they took, in terms of the mix of tournaments played, before reaching the Top 100. Just to clarify my terminology for the rest of the article: an “Age X” season refers to the year in which a player turned X years old. For example, Sinner was born in 2001, and so his “Age 17” season would be 2018. Musetti was born in 2002, and his “Age 17” season would therefore be the year 2019.

Let’s start with the tail ends of their respective junior careers.

Career High ITF World Junior RankingRank on January 1st, Going Into Age 17 Season
Jannik Sinner133133
Lorenzo Musetti14
Courtesy itftennis.com

Neither Musetti, nor Sinner, played much junior tennis past January 1st of their age 17 season. Lorenzo Musetti played just three more tournaments, one of which was the 2019 Australian Open junior event that he won. He ended up achieving the #1 world junior ranking following his 2019 French Open appearance. Sinner only competed in one junior event in his age 17 season: the Trofeo Bonfiglio Grade A in Milan, which is traditionally the second strongest junior clay court event, behind only the French Open.

While both players turned their attention to the men’s game before their 17th birthdays, the professional tournaments they had to access to – dictated to a large extent by their junior ranking – were different.

Let’s take a quick detour through the structure of men’s professional tennis. There are – roughly speaking – three levels of tournaments. Tournaments on the lowest rung of the ladder are called “Futures;” these offer the lowest prize money, and award the least ranking point totals for winning matches. Tournaments on the second rung are called “Challengers;” these are organized by the ATP, and while they offer higher prize money and higher point totals for winning matches, players need a higher ranking – again, generally speaking – to be entered into a Challenger draw than into a Futures event. Finally, the highest level of the ladder are the major ATP events you see on TV.

Furthermore, there are, for the purposes of this post, three main ways to enter a tournament. Let’s use a Challenger with 32 players competing in the main draw as an example. Out of the 32 players, 24 will be accepted directly into the main draw based on their ranking, 4 will receive a wild card, and 4 will advance from the qualifying draw. The wild cards and qualifying draws are designed to allow access to higher level tournaments to players, who wouldn’t have been accepted into the main draw based on their ATP ranking alone.

A major advantage of a high junior ranking is the attention from sponsors, national tennis governing bodies, and agencies. Once an agency signs a player, or a national tennis federation becomes invested in his/her success, one way in which they can help that player along is securing wild cards into professional events.

This is the breakdown of the first 10 professional tournaments Musetti and Sinner played, and how they entered them.

Jannik SinnerLorenzo Musetti
Futures Qualifying: Direct Acceptance71
Futures Qualifying: Wild Card2
Futures Main Draw: Wild Card12
Futures Main Draw: Junior Exempt2
Challenger Main Draw: Wild Card4
ATP Qualifying: Wild Card1
Total1010
Courtesy itftennis.com

Before we go any further, I don’t want to claim that one way is “easier” than the other. In the end, the athlete has to perform at a certain level to beat Challenger-level players and make it to the Top 100. I merely want to illustrate how two young stars got to where they are.

As the great philosopher Drake once said, Sinner “started at the bottom, now he’s here.” All 10 of his first professional tournaments were Futures; furthermore, he started in the qualifying draw in 9 of those. Qualifying draws of Futures are where dreams of professional tennis go to die; there is no prize money, and no ATP points are being awarded for winning matches. The goal for any young aspiring professional tennis player is to play themselves out of that level as quickly as possible.

Musetti, on the other hand, played in only one Futures qualifying event in his career. More importantly, 4 of his first 10 tournaments were Challengers, and he entered those by being awarded a wild card into the main draw. Two things are important to note: first, you start earning ATP points right away in the main draws of Futures and Challengers, hopefully reducing your reliance on wild cards in the future. And second, by being exposed to the level of play and environment of Challengers right away, the player in question hopefully feels like “I belong here” and “I can compete with these guys.” Musetti earned those wild cards by being an elite junior prospect and capitalized on his opportunities.

Despite the different tournament mixes in their first 10 professional events, it took both Sinner and Musetti just over 50 tournaments to achieve the coveted double digit ATP ranking. Below are the number of tournaments they competed at – broken down by the rungs of the ladder – before cracking the Top 100.

Jannik SinnerLorenzo Musetti
Futures2912
Challengers1533
ATP 250+117
Total5552
Courtesy itftennis.com

If I had to pick a path, I would pick Musetti’s, simply based on the fact that he spent very little time playing Futures tournaments. In terms of facilities, practice courts, official hotels etc., the Challengers are way closer to the ATP tournament standards than the Futures are. At the same time, I have a ton of respect for Jannik Sinner’s journey. It is not easy to play qualifying events of Futures as a 16/17 year old. The conditions can be rough, winning multiple matches in the qualifying draw before even having a chance to compete for ATP points – one needs a certain level of physical and mental maturity to handle the minor leagues of tennis. The fact that Sinner was able to do that as a teenager is impressive.

In the end, I bet that Musetti will soon join Sinner in the Top 20, and we’ll get to enjoy watching them compete at the highest level of our sport for more than a decade. Slightly different routes, same destination.