Walk Like a Sabermetrician: Royal Mythology

Monday, November 02, 2015

Royal Mythology

Rarely has the performance of a single team led to so many attempts to rationalize, explain, project virtue, and the like as the 2014-15 Royals. Focusing on the 2015 edition, here are just a handful of Royals myths that I have been particularly annoyed at hearing. The "analysis" that follows is not comprehensive nor is it intended to be. That's kind of the point. The level of extraordinary claims that have been made about the Royals should be apparent even with the crudest of inquiries into the objective record.

Myth #1: Whatever the Heck Andy McCullough Tweeted

"The entire point of the Royals is that baseball is a hard game and if you make your opponent do things, sometimes they will screw up"

The Kansas City Royals reached based on error 58 times in 2015. The AL average was 57. In 2014 they had 51 ROE versus a league average of 57.

Myth #2: The Royals Don't Make Mistakes

Errors leave a lot to be desired as a metric, but when traditional thinkers talk about making mistakes, errors are first and foremost on their mind. The 2015 Royals had a mFA of .973; the AL average was .971. The 2014 Royals had a mFA of .968; the AL average was .970.

Myth #3: The Royals had a long World Series drought

There are 30 MLB teams. It should be obvious, then, that 30 years is the expected time between world titles. Thus a streak of thirty years is not particularly long in theory. It's also not long in practice, as it was only the 12th longest drought (the Mets had the 13th longest drought). Last year en route to the pennant, two of the three teams Kansas City beat had (slightly) longer droughts and the other had a slightly shorter drought.

To find the Royals worthy of any particular sympathy, one must give extra credit for how poorly the franchise performed for much of that period. While this is unfortunate for the fans, it seems like such a group would be less traumatized by losing the World Series and more appreciative just to get there. Fan "suffering" is very low on my list of factors in deciding which teams to pull for in the playoffs, but to the extent I consider it, I tend to side with teams that have been good and just have not had the bounces go their way in October. Teams like the Marlins and the Royals who parlay their only two playoff teams in an extended period into pennants and world titles are quite galling to anyone who has rooted for a titleless yet competent franchise.

But more broadly, I think that the media and fans have yet to understand how championships will be distributed over the long haul in leagues that are double or close to it in size from what they were for so many years. Lengthy droughts, the types that the Red Sox, Cubs, or to a lesser extent Indians and Giants have suffered will be quite commonplace. Basic logic tells you that they have to be.

I did a "simulation" (which is a pretentious way of saying I used the RAND() function in Excel) to simulate 1,000 seasons of a thirty-team league in which each team had a 1/30 chance to win the World Series in any given year. Remember, this is the height of competitive balance. The probability of a championship could not be any more evenly distributed. There are no market disadvantages, no bad franchise stewardship, no billy goats. It is theoretically possible that the timing of championships could be more evenly distributed, but admittedly my imagination is insufficient to describe a specific scenario that would force a more even temporal distribution.

After 1,000 years, the average team should have had 33 1/3 titles. The most successful had 45; the two least successful each had 22 (as an aside, and granting that it was a sixteen team universe for an extended period, think about the Yankees' 27 in this context).

For years 501-1000, I calculated the average of the quartiles, as well as the percentage of active droughts as of a given year greater than 30 years. Since droughts for these 500 years are not independent of one another, be cautious with extrapolating those averages to anything else (for what it's worth, the medians are similar).

The average for these seasons was a first quartile drought of 8.4 years; a median drought of 20.2 years; a third quartile drought of 39.8 years, and a maximum drought of 115.0 years. In the average season, 34.4% of droughts exceeded 30 years (note that the current MLB figure is 12/26 = 46.2% of droughts exceeding 30 years, excluding the four subsequent expansion franchises, which suggests but in no way proves that, not surprisingly, the observed title distribution is not as egalitarian as the theoretical one used here).

Freezing it at year 1,000, this is what the drought picture looks like:

Even with new champions in 7 consecutive and 16 out of 20 seasons, a pretty typical 1/3 of droughts exceed 30 years, one team has exceeded the Cubs, and two more have exceeded the Indians.

The longest drought for any team during the millennium was 215 years. The poor fans of Team 6 celebrated a title in year 306, then went through many generations (or not, who knows, it's the future) before finally winning again in year 622. Then they waited another 120 years for good measure. Should baseball survive for 1,000 years with 30 or more teams, think about all of the narratives that the sportswriters of the future will get to craft.

Myth #4: The Royals Need to Be Explained

This is more of a meta-analytical comment than specific to the Royals, but there is an underlying notion, seen even on some sabermetrically-inclined outlets, that the Royals are an anomaly that demands our attention and an explanation. Please note that I am not criticizing the act of questioning ones premises, of attempting to update hypotheses as new data becomes available, of recognizing that we don't know everything about baseball, or anything of the sort. This all laudable. But such inquisition must not be confused with an imperative to find fault in one's null hypotheses either.

But there all too often is a reflexive desire to be too conciliatory, too eager to throw out one's existing knowledge and toolkit in an attempt to explain something that may just be a fluke. Witness "The Year That Base Runs Failed" (an article that demands a thorough undressing that I just do not have the will to give justice to right now). Recently this has seemed to manifest itself more at outlets that rely on 1) boisterous, opinionated writers and 2) daily content production.

When you are boisterous and opinionated, you need your opinions to be right in order to maintain credibility. If you have to blame the tools (Base Runs, W% Estimators, the entirety of sabermetric theory) that you used to justify your initial opinion, that's fair game. On the other hand, my position on the Royals doesn't demand I apologize for it (maybe I should--as I acknowledged above, I could be wrong, and inquiry into why that might be the case is healthy). My position is simply that the Royals were a fairly average team as indicated by their component statistics, but that sometimes teams outplay their component statistics. The Royals made the playoffs and over two seasons went 22-9, but a .500 team would go 22-9 or better with 1.5% probability--it's not likely but it also must happen now and again. You can disagree, but it's inherently a passive argument.

If you need to produce content daily, then you have to write about something, and writing "the sample size precludes us from drawing firm conclusions" over and over again doesn't drive readership. So there's a temptation to overfit your model, to declare that the secret sauce has been found, to cheat on the degree of certainty you require before you declare correlation to be causation, to investigate one positively correlated variable at the expense of other potential explanatory variables, to overreact to a year in which your metric's standard error is higher than it typically is.

Even great sabermetricians can get caught in this trap, and I have never been confused with a great sabermetrician but I have written things along these lines that I am not proud of as well. Bill James and Nate Silver have both, using different but understandable means when considered in the context of their work, failed pretty miserably at predicting playoff success based on historical data. The simple fact of the matter is that there were 32 playoff games (not counting the wildcard games) this season, which is fairly typical. At 30 games/season, you need five seasons to have a sample size the same as that of one major league team-season.

This is particularly problematic when so many of the attempts to explain playoff performance are based on theories about changes in the game. Contact superseding Moneyball, bullpen construction and usage patterns which have been in a constant state of change throughout baseball history...you could never have credible data without the conditions of the game shifting. This is not to say don't try to advance our understanding, it's to say be extremely cautious as you attempt to do so.

So what winds up happening is that a potential explanation ("Contact works, allow it" is a particularly poor paraphrase since it makes it sound like your pitchers should allow contact, but I saw that Colin Cowherd promo to many times not to use it) is honed in on, and maybe there's evidence of some effect, so other potential explanatory variables are ignored and the correlation is exaggerated and soon there's a truism that must be disproved rather than a hypothesis which must be proved.

There's a difference between saying "I don't know" and "No one will ever know". If it seems as if my school of thought arrives at the latter, that's a fair criticism. But I personally would rather be too certain about how much I can't know than to be too quick to think I've learned something new.

2 comments:

AnonymousNovember 3, 2015 at 8:10 AM
Hey P, Love the blog and you highly entertaining Twitter feed. I'm moved to comment on a few of the points here however, because I've been moderately surprised at the backlash against the Royals success in some circles and, it's just a guess, but I feel some of my favorite saber-followers are clouded by the understandable dislike of the owners, the logic-challenged manager, perhaps the unlikable attitude of the players, etc.

If I were your editor (admittedly a pompous construct) here's what I'd send back to you. Let's start with the trivial: On the modeling of droughts, effectively, the Royals have only played in a 30-team league for the past 3 years. Owing to only 14 teams in the AL from 1986-2012, they were in a 28-team league for the other 27 years. Again, that's trivial, but it does make their pennant drought (though not the WS title drought) a little more dramatic. Or, I suppose it only serves to greater underscore your point on their era(s) of incompetence.

In citing the Royals playoff success the last two years you wrote "A .500 team would go 22-9 or better with a 1.5% probability." This is true only if the .500 team is playing 31 games against teams that are league average as a whole. Or stated another way, it's true against teams which are its equal. But look who the Royals have gone through, the Angels, Orioles, and A's in '14 and Toronto and Houston in '15 plus the Giants and Mets. You know the winning percentages of those teams in those seasons, and apply whatever 3rd order win/true talent level you want, but it has to be way, way higher than league average. If it is, than the 1.5% probability is also way too high -- unless you're saying the Royals are even with those teams and really a .560 or .570 (or whatever) team. If so, that would appear to undermine your premise (and I believe the source of your annoyance) that they are, from a true talent perspective, "a fairly average team."

I confess to being quite surprised at the ROE stat that opens your piece. Because on a rate basis it means they are well below average, given the extra balls they hit into play. But JFC it's errors, and I don't even track them because I find the stat so utterly meaningless. So while you did debunk that small element of the premise, I believe the larger argument is this. The Royals won the World Series exactly the way many said they would -- and exactly the way they buried their competition in the AL Central -- by putting pressure on defenses and responding better to that pressure on their own. Consider this: Over just 5 games the Royals hit 30 (164 to 134) more balls into play than the Mets. Yet Kansas City's defense recorded just 10 fewer outs (103 to 113) on those batters who hit a ball into play (includes DPs and baserunner kills) than the Mets. It doesn't take long to realize that after about ten games at this pace you realize, "the Mets need to see 25,30 etc. more balls in play to record the same amount of outs as the Royals!" That, to me, is a repeatable path to victory, and in fact it's how they won all year (and last year.) The fact that during the World Series the miscues often happened late in games, giving the illusion that the Royals have some sort of late-game magic is nonsense -- putting constant pressure on a defense with more balls in play AND doing a better job of converting the ones that you face is not. All season (especially in the first half when they effectively sealed the division by the All-Star break) the Royals were way above average at doing this while the Mets were below. The fact that it played out exactly that way in the World Series is what, I believe, people of my ilk were referring to when they say the Royals force mistakes. It was satisfying to see something play out exactly as modeled.

Thanks for taking the time to expand on your thoughts in your blog. As opposed to drawing conclusions from someone's 140-character musings, it's very helpful.

Joe Peta
@MagicRatSF
ReplyDelete
Replies
pNovember 4, 2015 at 7:07 PM
Joe,

Thanks for your thoughtful feedback. (And sorry for not posting this promptly, I usually forget to check for comments to approve).

On the 30 team issue, you're absolutely right but it is also, as you say, very trivial. In a true, evenly constructed/championship-distributed 30-team league the probability of going 28 years without a championship is 38.7%; in the league/division sizes the Royals faced (accounting that there were 5 teams in the AL Central, which lowers their probability relative to the league which had an average of 4.8 teams/division) was 37.4%.

Your critique of "the probability of a .500 team going 22-9" discussion is fair, but you're reading more into it then I intended (which is the author's error in this case, not the reader's). It was intended more as a general point about a 22-9 record not being significant enough to compel an explanation. Whether it was closer to 0.5%, 2.0%, or anything else doesn't chance my contention on that. When I wrote the phrase "a fairly average team", I had in mind their .520 estimated W% based on Runs Created and Allowed. I don't disregard their actual W%, mind you, but it was actually meant to be complementary relative to what someone might perceive my position to be--that this was a true talent 78 win team winning it all.

It may be a semantic point, but when I hear "putting pressure on the defense" or "forcing mistakes", I think of errors (be they statistical errors or poor decisions like the ninth inning), not balls in play. Certainly the Royals' extreme contact rate is well-documented, but the end result over 162 games was a middling offense. I have yet to see any real evidence that this approach is of particular benefit in the postseason.

While I share the standard sabermetric contempt for the error as a concept, I actually think ROE are fairly important as they are one of the few ways in which the pressure applied to the defense would manifest itself beyond the traditional statistical record (baserunning is the other obvious area). The spread of team ROE over the course of a season (and the impact on run scoring) is roughly equivalent to the spread of team hit batters, which is not huge but also something to consider when bridging the gap between Runs Created and actual runs scored.

In any event, the 2014 Royals outhomered their postseason opponents 11-8, and while in 2015 they were outhomered 17-21, 17 home runs in sixteen playoff games is quite the prodigious output. But the media narrative played up the contact element while usually ignoring that in the playoffs they relied much more heavily on the longball than they had during either regular season.

With respect to the potential that I other saberfolk might be objectivity-challenged when it comes to the Royals, I can only speak for myself but there is certainly some truth there. As an Indians fan, I do find it annoying that during my time as a fan, the teams have shared the same division, the Indians have been a significantly better-run franchise, a significantly better performing franchise, a comparable performing team over the last two seasons in terms of component statistics), have had the best record in MLB 2.5 times, and have the same number of pennants and one less World Series than the Royals to show for it. I imagine Tigers fans might feel the same way. This is not jealousy, at least of anything but the one shiny trophy; I certainly would not trade the Indians fan experience from 1994-2015 for the Royals fan experience. However, I am also quite certain that the Royals would manage to get under my skin even if they played in, say, the NL Central, due to some of the other factors you mentioned.

Again, thanks for the comment. And I now have the sax solo from Jungleland running through my head which is never a bad thing.
ReplyDelete
Replies

Add comment

I reserve the right to reject any comment for any reason.

Walk Like a Sabermetrician

Monday, November 02, 2015

Royal Mythology

2 comments:

Me, Elsewhere

Analysis Links

Reference Links

Blog Archive

OSU Baseball

End of Season Statistics

Win Shares Walkthrough

NL 1876-1881 Series

Labels

About Me