tag:blogger.com,1999:blog-12133335.post3264313151675522098..comments2015-01-26T05:43:03.818-05:00Comments on Walk Like a Sabermetrician: The Curious Case of Brooks Robinson's Batting Runs (rWAR)phttp://www.blogger.com/profile/18057215403741682609noreply@blogger.comBlogger11125tag:blogger.com,1999:blog-12133335.post-30376468856141138532010-11-29T12:04:03.115-05:002010-11-29T12:04:03.115-05:00Thinking about this has reminded me why I don'...Thinking about this has reminded me why I don't particularly like reconciling runs created to equal team runs. Doing it by altering linear weight coefficients makes the downside more obvious than when it is covered up by simply multiplying RC by the ratio of team R to RC (as Bill James does).<br /><br />Rally's approach assumes that the ancillary +/- run figures are without error--but in fact, they are subject to the same kind of error that the coefficient for any event is. All of the weight of reconciliation is borne by the batting events. <br /><br />In theory, the value of a batting event is the average change in RE due to events of that type. By using BsR to estimate that value, we doing some combination of 1) acknowledging that the sample size is insufficient in many cases to use empirical values and 2) making the calculations a lot easier (even if you could use a single team-season RE table with confidence, it would be a royal pain). One can view changing the coefficients due to an error in estimating actual runs scored as a shorthand way of adjusting the RE table--but actually doing so would also change the ancillary figures.<br /><br />The other issue I have is that the process of reconciling has more effect on players that are estimated to create a lot of runs (the percentage change will have a greater effect on them, which can also be seen through the fact that the absolute value of the out will change very little, while the values of the positive events will be more fluid). But I see no obvious reason why this should necessarily be the case.phttp://www.blogger.com/profile/18057215403741682609noreply@blogger.comtag:blogger.com,1999:blog-12133335.post-59156413144494093542010-11-27T00:42:25.770-05:002010-11-27T00:42:25.770-05:00Things aren't going to sum to zero because the...Things aren't going to sum to zero because the numbers are rounded at the player level. So things like +13, -18, etc. are completely expected. That does not explain the -142 for the batting runs AL 1980, at least I doubt it would.<br /><br />That one is on me, for years before 2010 BB-ref just took what I provided and loaded it onto the site. 2010 and on is Sean Forman's implementation. That is the worst figure of the DH era before pitchers took to bat again, some of the other years have differences like 2 or 4. I don't have an explanation but I'll let you know what I can find.Rallyhttp://www.baseballprojection.comnoreply@blogger.comtag:blogger.com,1999:blog-12133335.post-7368076875398302802010-11-26T22:28:16.112-05:002010-11-26T22:28:16.112-05:00The linear weight values are fairly close to what ...The linear weight values are fairly close to what I got. Using those weights I had the non-pitcher average as .173 R/O, so .169 is also in the neighborhood. I have the PF at .99, which I certainly wouldn't want to argue strenuously is more accurate than 1.01.<br /><br />Calculating as you just described, I get -14 versus your -13, so no problem there. It's possible that I overreacted to easily explicable differences.<br /><br />The ancillary category adjustment rubs me the wrong way (just an opinion), as it seems to treat those figures as correct and adjust real runs scored accordingly, placing all the burden for reconciliation on the batting component. <br /><br />I'm still a little concerned about the team/league figures (again with the caveat that this might well be a problem on B-R's end and not in your methodology. Some of the leagues, even those without pitchers batting, have offensive runs that don't sum to zero. For instance, 1980 AL:<br /><br />-142 bat, +5 bsr, -13 roe, -18 dp = -168 (12/ team)<br /><br />Of course, once you drill down to the player level the 168 runs is spread pretty thin, and there are systems that don't reconcile and thus start with similar differences.phttp://www.blogger.com/profile/18057215403741682609noreply@blogger.comtag:blogger.com,1999:blog-12133335.post-38144697079392862612010-11-26T20:54:47.802-05:002010-11-26T20:54:47.802-05:00"Let's say you have a team that scores 80..."Let's say you have a team that scores 800 runs in a pitcher's league, with the total of +15 baserunning/GDP/ROE runs you described. When you reconcile your BsR, are you reconciling to 800? 785 (this is what I gather from the quote above)? Something else?"<br /><br />785. If the team scores 800 in an average park where league average is 750, then I want the bat, baserunning, ROE, GIDP to sum to +50. Is that the best way to do it? I don't know. But that's how I did it.<br /><br />BB-ref has taken the numbers I provided and summed them up for a team up through 2009. For 2010, I believe Sean Forman is just using his linear weights batter runs in the WAR calculation. When we discussed the details, my opinion was that one measure was not better than the other, they usually come to the same results, and since he already calculated one no need to try and implement what may be an overly complex formula.<br /><br />For Brooks I get the following customized LW values for Baltimore:<br />1b .49 2b .80 3b 1.1 hr 1.42 ubb .32 ibb .13<br />hbp .35 out -.11<br /><br />The park factor for 1969 is 1.01 (I just used the baseball-databank figure), looks like it's the multi-year bpf on bb-ref.<br /><br />Then, after adjusting the RC from the above formula, subtract .169 * outs (AB-H only) to get runs above average.Rallynoreply@blogger.comtag:blogger.com,1999:blog-12133335.post-50085203633261759652010-11-26T18:52:50.754-05:002010-11-26T18:52:50.754-05:00There are a number of leagues that were non-pitche...There are a number of leagues that were non-pitcher hitting that don't add up to zero (although the discrepancies are of a much smaller magnitude, although the ones I checked are still around 4.5 runs/team). For example, for the 1987-93 AL, the league batting totals are:<br /><br />98, -99, 65, -99, -20, -74, 11<br /><br />I realize that this may be a B-R problem and not a problem on your end, as you don't provide the team totals on baseballprojection.com.<br /><br /><i>I set the baserun linear weights to deal with non-pitchers, and they zero out for those players. But the league total includes pitchers as well. So something like non-pitchers zero, pitchers -685 at the league level.<br /><br />Another step is to remove any runs from baserunning, ROE, and GIDP from the team totals so that these are not double counted. So if team runs (leaving aside pitcher hitting) is 750 runs, and the team was +15 baserunning, -5 in GIDP, and +5 in ROE, then I assume 735 runs need to be accounted for by the batting event</i><br /><br />Let's say you have a team that scores 800 runs in a pitcher's league, with the total of +15 baserunning/GDP/ROE runs you described. When you reconcile your BsR, are you reconciling to 800? 785 (this is what I gather from the quote above)? Something else?<br /><br />IMO, I'm not sure it's a good idea to use the estimated ancillary components as offsets against known runs scored, although I can see the upside to doing it this way. I think it might be a better approach to use unadjusted batting values, and then apply some sort of corrector to the team's total of batting + ancillary.<br /><br /><i>Colin Wyers has my contact information, I find it a little curious that he didn't ask me about this directly.</i><br /><br />Don't blame Colin--we were just having a discussion on Twitter, and I'm the one that went and escalated it to blog post level.<br /><br /><i>(Also, am I the only person that writes a comment here, hits posts and switches windows before it gets to the CAPTCHA?)</i><br /><br />I turned the CAPTCHA on because all of a sudden I was getting 40 spam Russian spam comments a day. My spam filter was catching them but when I almost deleted an actual comment by accident....I should probably try turning it back off, since I know I hate it on other sites.phttp://www.blogger.com/profile/18057215403741682609noreply@blogger.comtag:blogger.com,1999:blog-12133335.post-71907522917494614362010-11-26T18:44:32.005-05:002010-11-26T18:44:32.005-05:00As for why I didn't contact Rally about it dir...As for why I didn't contact Rally about it directly - I wasn't necessarily meaning to get into a discussion of the issue, we were simply discussing this blog post:<br /><br />http://www.beyondtheboxscore.com/2010/11/26/1834014/gidp-the-underrated-production-killer<br /><br />after it had been linked by Primer and I was rather taken aback by the figures for Brooks. It snowballed from there. I had meant to send you something, but my kid's off school today so I've been rather busy.<br /><br />(Also, am I the only person that writes a comment here, hits posts and switches windows before it gets to the CAPTCHA?)Colin Wyershttp://www.baseballprospectus.comnoreply@blogger.comtag:blogger.com,1999:blog-12133335.post-24982479217515318912010-11-26T18:27:41.607-05:002010-11-26T18:27:41.607-05:00Rally, Patriot corrected himself on the '69 le...Rally, Patriot corrected himself on the '69 league figures in the comments after I pointed the pitchers issue out to him.<br /><br />What I - and I think Patriot as well - are interested in is figuring out how you come to the values you do for Brooks. He obviously put more effort into the matter than I did, but neither of us are able to figure out how to go from the underlying components to the results you give.Colin Wyershttp://www.baseballprospectus.comnoreply@blogger.comtag:blogger.com,1999:blog-12133335.post-51820669757748156162010-11-26T18:13:57.359-05:002010-11-26T18:13:57.359-05:00The leage average team for 1969 shows -57 batting ...The leage average team for 1969 shows -57 batting runs, so the +40 on BB-ref is equivalent to +97.Rallynoreply@blogger.comtag:blogger.com,1999:blog-12133335.post-45686460646834579632010-11-26T18:04:52.445-05:002010-11-26T18:04:52.445-05:00No, there is no value in the team batting runs, be...No, there is no value in the team batting runs, because the intent was to measure the individual, and to put the players in DH leagues on an even playing field with those in non-DH leagues. If I was trying to compare team offense I would have done things differently.<br /><br />And if you want to ask me questions, you can email me at rallymonkey (numeral five) at comcast dot net.<br /><br />Colin Wyers has my contact information, I find it a little curious that he didn't ask me about this directly.Rallyhttp://www.baseballprojection.comnoreply@blogger.comtag:blogger.com,1999:blog-12133335.post-12564395687457851562010-11-26T17:59:39.591-05:002010-11-26T17:59:39.591-05:00When I first read this I thought the 1969 league f...When I first read this I thought the 1969 league figure had to be an error, but it actually isn't.<br /><br />I set the baserun linear weights to deal with non-pitchers, and they zero out for those players. But the league total includes pitchers as well. So something like non-pitchers zero, pitchers -685 at the league level.<br /><br />Another step is to remove any runs from baserunning, ROE, and GIDP from the team totals so that these are not double counted. So if team runs (leaving aside pitcher hitting) is 750 runs, and the team was +15 baserunning, -5 in GIDP, and +5 in ROE, then I assume 735 runs need to be accounted for by the batting events.<br /><br />Sean Forman's batting wins puts Brooks around +50, for a 20+ year career 30 runs is not an uncommon difference.Rallyhttp://www.baseballprojection.comnoreply@blogger.comtag:blogger.com,1999:blog-12133335.post-13569549943948784102010-11-26T13:57:04.892-05:002010-11-26T13:57:04.892-05:00I am learning today that I have a hard time rememb...I am learning today that I have a hard time remembering to account for pitcher hitting--it's been pointed out to me that the league totals remove pitcher hitting from the baseline, but include pitchers on the league level. While this explains some of the problem, it doesn't speak well for the value of the team Batting Run totals.<br /><br />Also, it doesn't fully explain the issue with the 1969 Orioles as a team. The 1969 AL average of BsR-figured RC/O is .173 for non-pitchers. The '69 Orioles made 4053 outs and scored 779 runs, which result in something like 779-.173*4053 = 78 batting runs, still a far cry from the 40 provided by B-R.phttp://www.blogger.com/profile/18057215403741682609noreply@blogger.com