Why AL/NL WARs Differ in a Given Year (Hint: it’s more obvious than I thought)

Recently I made the “shocking” discovery that the AL and NL don’t have the same season WAR totals (on a per-team basis), even before interleague play. Of course I wondered why that is. After much verbal head-scratching on my part, Ed very kindly pointed out that the obvious answer I had been rejecting was, indeed, the answer:

WAR formulas are intentionally tweaked to reflect the relative strength of the leagues; otherwise, player WAR could not be used for any meaningful comparison across different leagues or seasons.

Here’s the explanation that appears on B-R:

[T]the leagues are not always equal in their quality levels as evidenced by things like inter-league play and also player performances when shifting leagues. Taking these differences into account assign slightly different multipliers to the leagues, but centered on 20 for 162 game seasons and 19 for 154 game seasons. One example of this is the post-war integration. The National League integrated far more quickly than the American League and was a higher quality league until the 1970’s.

I still don’t know just how the difference in leagues is assessed. But as an exercise, I tried to see if league strength could be seen in the players who moved between the leagues during the period 1946-68 — the post-war, pre-expansion era, which is where I first noticed the large difference in league WAR. As it turned out, almost all of those who spent roughly 2 full seasons in each league looked better in the AL, relative to that league.

There were only 12 players who had 1,000+ PAs in both the AL and the NL during 1946-68. This list shows their OPS+ in each league for those years only:

  1. Chico Fernandez: AL 72, NL 61, +9 points of OPS+ in the AL
  2. Frank Bolling: AL 91, NL 79, +12
  3. Eddie Bressoud: AL 109, NL 81, +28
  4. Gino Cimoli: AL 84, NL 85, -1
  5. Tito Francona: AL 110, NL 92, +18
  6. Bill Bruton: AL 97, NL 95, +2
  7. Jackie Brandt: AL 103, NL 97, +6
  8. Harvey Kuenn: AL 112, NL 98, +14
  9. Roy Sievers: AL 127, NL 107, +20
  10. Dick Stuart: AL 119, NL 116, +3
  11. Frank Howard: AL 148, NL 125, +23
  12. Frank Robinson: AL 182, NL 150, +32

Average: +14 points of OPS+ in the AL.

And the ERA+ for the 23 pitchers with at least 429 innings* in each league during 1946-68:

  1. Jim Bunning: AL 116, NL 129, -13 points of ERA+ in the AL
  2. John Buzhardt: AL 100, NL 94, +6
  3. Gene Conley: AL 90, NL 107, -17
  4. Moe Drabowsky: AL 105, NL 94, +11
  5. Jack Fisher: AL 99, NL 84, +15
  6. Ron Kline: AL 118, NL 97, +21
  7. Mike McCormick: AL 94, NL 100, -6
  8. Cal McLish: AL 107, NL 91, +16
  9. Don McMahon: AL 141, NL 111, +30
  10. Stu Miller: AL 145, NL 107, +38
  11. Billy O’Dell: AL 128, NL 102, +26
  12. Claude Osteen: AL 111, NL 101, +10
  13. Milt Pappas: AL 113, NL 98, +15
  14. Juan Pizarro: AL 115, NL 88, +27
  15. Robin Roberts: AL 115, NL 113, +2
  16. Johnny Sain: AL 103, NL 109, -6
  17. Johnny Schmitz: AL 112, NL 106, +6
  18. Bob Shaw: AL 103, NL 107, -4
  19. Gerry Staley: AL 140, NL 100, +40
  20. Hoyt Wilhelm: AL 162, NL 130, +32
  21. Stan Williams: AL 111, NL 106, +5
  22. Jim Wilson: AL 98, NL 86, +12
  23. Al Worthington: AL 141, NL 102, +39

Average: +13 points of ERA+ in the AL.

These are small samples, but the consistency and the size of the difference strongly suggests that the NL had a significantly higher level of competition in this period. Eleven of the 12 hitters and 18 of 23 pitchers had higher “+” ratings in the AL.

This doesn’t explain why the AL won 13 of those 23 World Series, but we all know that a 7-game series isn’t as telling as a full season.

___________________

* Why 429 IP? I only wanted to use one page of P-I results, and 429 IP was the total for the 200th guy in the NL in that span.

0 0 votes
Article Rating
Subscribe
Notify of
guest

36 Comments
Oldest
Newest Most Voted
Inline Feedbacks
View all comments
Mike L
Mike L
11 years ago

“This doesn’t explain why the AL won 13 of those 23 World Series” The Yankees won ten out of fifteen times. The rest of the AL was 3-5.

e pluribus munu
e pluribus munu
11 years ago
Reply to  Mike L

Which suggests that for much of that period, the distribution of talent was probably less balanced in the AL – though not as different from the NL as the simple tally of Yankee pennants might indicate.

MikeD
MikeD
11 years ago
Reply to  Mike L

Exactly. The NL was the stronger league, but that doesn’t mean the strongest team wasn’t in the AL.

Bells
Bells
11 years ago

Is OPS+, then, not adjusted for league? Baseball-reference seems to suggest it’s not… copy/pasted explanation of the measure:

Statistic Description: OPS+ 100*[OBP/lg OBP + SLG/lg SLG – 1] Adjusted to the player’s ballpark(s)

Just checking, though, as I’m not really familiar with how these things are calculated. Why would OPS+ be only league-relevant, but WAR be adjusted for minor differences in league strength? Perhaps it’s simple, but my brain isn’t working so well today.

bstar
bstar
11 years ago
Reply to  Bells

Yes, I think OPS+ is adjusted for league. At least that’s what Fangraphs says in their explanation of it:

http://www.fangraphs.com/library/index.php/offense/ops/

“…Since OPS+ adjusts for league and park effects, it’s possible to use OPS+ to compare players from different years and on different teams…”

bstar
bstar
11 years ago
Reply to  John Autin

I think you’re right about the interpretation. sorry bells.

Bells
Bells
11 years ago
Reply to  bstar

No, thanks bstar, I had the same interpretation as you and was like ‘wait, wouldn’t that make John A’s comparison tautological’? But I figured I must be wrong (John is usually pretty rigorous), and am glad that I am, because otherwise the mystery would still be there.

Forrest
11 years ago

Interesting. So now that we’re about to have seasons where there’s an interleague game EVERY DAY, will WAR stop being tweaked to reflect the league the player’s in?

e pluribus munu
e pluribus munu
11 years ago
Reply to  John Autin

Wouldn’t any change in the number of games alter the formula for weighting? After all, if interleague games were 50% of all games, the context discrepancy would presumably have shrunk to zero.

e pluribus munu
e pluribus munu
11 years ago
Reply to  John Autin

Right, JA, and another instance of my natural talent for making statements that are both trivially true and irrelevant. In this case I thought “every day” might have gotten confused with the concept of 50/50 balance. I craftily avoided clarity in my response.

kds
kds
11 years ago
Reply to  Forrest

Won’t make that much difference. From 252 inter-league games to 300 out of 2430 for the full schedule. Let’s compare Mays in the NL to Mantle in the AL. I’ve chosen 1955-1962 to get Mantle’s best years, Mays’ best 8 consecutive years were 1958-1965, so this is a little unfair to him. Mays had a few more PA, being healthier, and is just ahead in brWAR 69.2 to 68.1. Mantle was a considerably better hitter, 512 to 392 in Rbat and Mays’ advantage in defense leaves him still 42 behind by 42 RAA, 553 to 511. This leaves Mays behind… Read more »

no statistician but
no statistician but
11 years ago

JA: As usual, I have to quibble a little. The pattern of several of the position players here—Bolling, Francona, Kuenn, Sievers—shows a lower OPS+ in the NL at least in part because they had entered into their declining years as performers when they switched leagues. Stuart’s two big years in the AL don’t match his two best in the NL, and it’s only his clunker 1962 season that draws down his NL numbers. Jackie Brandt spent his prime years in the AL and his early and late years in the NL. The two big dogs at the end of the… Read more »

no statistician but
no statistician but
11 years ago
Reply to  John Autin

Yes—except that Conley actually spent his last years in the AL. Brain cramp.

Doug
Doug
11 years ago

I think comment #11 from kds explains the league differences more succinctly (and with greater precision) than what might be inferred from a grab-bag of test case players with selection difficulties such as those identified by nsb in comment #10.

Would be interesting to see the league-wide numbers kds mentions on a year-by-year basis, to see if they correspond to the perception that the AL has had a superior level of play in recent seasons.

Doug
Editor
11 years ago
Reply to  John Autin

John, This was the kds quote that intrigued me. The difference between replacement level and average was 22 runs/162 games in the NL, but only 18 runs/162 games in the AL. The average AL player was about 4 runs per season worse than the average NL player. It sounds like kds has looked at WAR minus WAA on a league-wide (all players) basis, and then converted to RBAT. Subtracting one from the the other may not be entirely legitimate statistically, but I liked the approach conceptually, based on the notion that replacement level should be about the same for both… Read more »

Bells
Bells
11 years ago
Reply to  John Autin

That’s still what bothers me too, John. The 1956 differences you cite are large, especially considering that a) the leagues were closed circuits save for a 4-7 game playoff series, and b) the number of samples of players switching, although telling a pretty clear story from your original post’s analysis, is really still small. A simple ‘sign test’ of binomial probability would certainly show it unlikely to randomly get 11 out of 12 position players to be better in one league, and without doing the math, 18 out of 23 pitchers being better in the AL is unlikely too, possibly… Read more »

Ed
Ed
11 years ago
Reply to  John Autin

John – From what I can tell, Fangraphs does NOT make this adjustment in their version of WAR. I’m looking at the line in the comparison chart that says “Varies Replacement Level by Quality of Competition”. Would be interesting to know why they chose not to adjust.

http://www.baseball-reference.com/about/war_explained_comparison.shtml

Ed
Ed
11 years ago
Reply to  John Autin

Perhaps John though as far as I’m aware the place in which bWAR “Varies Replacement Level” is between the leagues. I’ve never seen anything that would suggest that players within the same league are assigned different replacement levels based on not facing their own teammates. They may make adjustments for that but making adjustments is different than “varying replacement level”.

Ed
Ed
11 years ago

So nice to be mentioned in a HHS post! 🙂 As for the World Series, I have a few different thoughts. 1) One thing to look at would be total the batter and pitching WAR for each team in the WS and see how often the team with the higher WAR won. As commenters 1, 2 and 15 have noted, just because league A is stronger then league B, doesn’t mean that the top team from league A is stronger than the top team from league B. 2) Another thought is this…one thing we know about the playoffs is that… Read more »

Doug
Editor
11 years ago
Reply to  Ed

The fact that the Yankees so utterly dominated the AL during this period would also be support the premise of a lower quality of play in the AL. Easier to come out on top consistently playing against lesser opponents.

no statistician but
no statistician but
11 years ago
Reply to  Doug

Doug: The Dodgers and Giants together did a fair job of dominating the NL in that era, with 13 titles(Dodgers with 10) to the Yankees’ 15, so I’m not sure your argument is that strong. As for the rather startling 20% difference between leagues in 1956 cited by JA @ #18, I suspect it’s a crock, but I have no way to substantiate my suspicion. The NL had a three team race that went down to the wire, and the Yankees pulled away from the pack in early July, but in terms of competition within the leagues, the rest of… Read more »

Richard Chester
Richard Chester
11 years ago
Reply to  John Autin

Why is Chico Carrasquel on that list? His uncle Alex already had 8 years in the ML prior to his (Chico’s) arrival.

kds
kds
11 years ago

If you go to a players “value” section where the WAR computations are shown and highlight a part of the players career, you will get not only the totals of each column for that period, but also averages per year and per 162 games. So you can just look at Rrep per 162 games played. That is what I did for Mays/Mantle. For pitchers it is a bit more difficult since there isn’t a Rrep column. But you do have RAA and RAR, so the difference should be Rrep. (Have to be a little careful since different pitchers can have… Read more »

kds
kds
11 years ago

Now that I’ve done this work, I reread the explanations at B-ref of how they figure WAR. In the section on position players, down near the bottom, there is a table showing the # of replacement runs for a full time player for each league in each year, 1871-2012. Interestingly, the NL has a substantial lead in 1946, the year before the start of integration. The advantage for the NL continues to 1969.

Richard Chester
Richard Chester
11 years ago
Reply to  kds

See my post #72 under the “Talk About Athletics and Tigers ALDS Game 1” blog.