Roger Clemens, 1996, and the other unluckiest pitching seasons of all-time

In 1996, Roger Clemens had an offseason by his standards, off enough that it may have been the spur to get him on steroids.  It certainly earned him a ticket out of Boston, off to a new team and a career rebirth in Toronto, and at the time, his departure might not have seemed unwarranted. By traditional metrics, 1996 was but a mediocre prelude for Clemens to winning back-to-back Cy Young awards and going 41-13 with a 2.33 ERA over 1997 and 1998. Clemens went 10-13 with a 3.63 ERA for the Red Sox in 1996, walking the most batters of his career with 106. Pushing 35, he looked to be on the decline, a shell of his once-dominant self.

Clemens did lead the American League in strikeouts in 1996 with 257. And in hindsight, we also know that he led the AL in strikeouts per nine innings with 9.5 and finished second in WAR with 7.7. In fact, it’s one of the best  losing seasons for a starting pitcher in baseball history.

One of my colleagues here, Doug, did a post a few days ago on if Matt Cain was the unluckiest pitcher ever. The post got me thinking. Doug looked at Cain’s career numbers compared to other unlucky hurlers, so I decided to take another look and compile some of unluckiest seasons for pitchers in baseball history.

The first is the ten highest WARs posted by starting pitchers with sub .500 winning percentages:

Rk Player SO WAR ▾ W-L% Year Age Tm G GS CG SHO W L SV IP H R ER BB ERA ERA+
1 Ed Walsh 258 8.7 .474 1910 29 CHW 45 36 33 7 18 20 5 369.2 242 87 52 61 1.27 189
2 Jon Matlack 195 8.6 .464 1974 24 NYM 34 34 14 7 13 15 0 265.1 221 82 71 76 2.41 149
3 Phil Niekro 262 8.5 .444 1977 38 ATL 44 43 20 2 16 20 0 330.1 315 166 148 164 4.03 111
4 Dave Roberts 135 8.5 .452 1971 26 SDP 37 34 14 2 14 17 0 269.2 238 79 63 61 2.10 157
5 Roger Clemens 257 7.7 .435 1996 33 BOS 34 34 6 2 10 13 0 242.2 216 106 98 106 3.63 139
6 Turk Farrell 203 7.4 .333 1962 28 HOU 43 29 11 2 10 20 4 241.2 210 91 81 55 3.02 124
7 Nap Rucker 151 7.4 .462 1912 27 BRO 45 34 23 6 18 21 4 297.2 272 101 73 72 2.21 151
8 Ned Garver 85 7.1 .419 1950 24 SLB 37 31 22 2 13 18 0 260.0 264 120 98 108 3.39 146
9 Irv Young 156 7.0 .488 1905 27 BSN 43 42 41 7 20 21 0 378.0 337 146 122 71 2.90 106
10 Bert Blyleven 219 6.7 .448 1976 25 TOT 36 36 18 6 13 16 0 297.2 283 106 95 81 2.87 125


And here’s a list that looks at the ten best ERA+ scores for starting pitchers with losing records and at least 162 innings pitched:

Rk Player SO ERA+ ▾ W-L% IP Year Age Tm G GS CG SHO GF W L SV H R ER BB ERA
1 Ed Siever 36 197 .421 188.1 1902 27 DET 25 23 17 4 2 8 11 1 166 73 40 32 1.91
2 Ed Walsh 258 189 .474 369.2 1910 29 CHW 45 36 33 7 7 18 20 5 242 87 52 61 1.27
3 Ben Sheets 264 162 .462 237.0 2004 25 MIL 34 34 5 0 0 12 14 0 201 85 71 32 2.70
4 Hal Newhouser 103 162 .364 183.2 1942 21 DET 38 23 11 1 14 8 14 5 137 73 50 114 2.45
5 Joe Magrane 100 161 .357 165.1 1988 23 STL 24 24 4 3 0 5 9 0 133 57 40 51 2.18
6 Dave Koslo 64 160 .440 212.0 1949 29 NYG 38 23 15 0 12 11 14 4 193 72 59 43 2.50
7 Curt Schilling 194 159 .471 168.0 2003 36 ARI 24 24 3 2 0 8 9 0 144 58 55 32 2.95
8 Ned Garvin 94 159 .238 193.2 1904 30 TOT 25 24 16 2 1 5 16 0 155 85 37 80 1.72
9 Dave Roberts 135 157 .452 269.2 1971 26 SDP 37 34 14 2 3 14 17 0 238 79 63 61 2.10
10 [tie] Dolf Luque 140 156 .471 291.0 1925 34 CIN 36 36 22 4 0 16 18 0 263 109 85 78 2.63
10 [tie] Dutch Leonard 92 156 .414 225.2 1948 39 PHI 34 30 16 1 2 12 17 0 226 85 63 54 2.51


And here are the ten best strikeouts per nine inning rates for starting pitchers with losing records and 162 innings pitched:

Rk Player SO SO/9 ▾ W-L% IP Year Age Tm G GS CG SHO GF W L SV H R ER BB ERA ERA+
1 Nolan Ryan 270 11.48 .333 211.2 1987 40 HOU 34 34 0 0 0 8 16 0 154 75 65 87 2.76 142
2 Curt Schilling 194 10.39 .471 168.0 2003 36 ARI 24 24 3 2 0 8 9 0 144 58 55 32 2.95 159
3 Nolan Ryan 327 10.35 .486 284.1 1976 29 CAL 39 39 21 7 0 17 18 0 193 117 106 183 3.36 99
4 Randy Johnson 241 10.31 .462 210.1 1992 28 SEA 31 31 6 2 0 12 14 0 154 104 88 144 3.77 105
5 Sandy Koufax 197 10.13 .381 175.0 1960 24 LAD 37 26 7 2 7 8 13 1 133 83 76 100 3.91 101
6 Ben Sheets 264 10.03 .462 237.0 2004 25 MIL 34 34 5 0 0 12 14 0 201 85 71 32 2.70 162
7 Nolan Ryan 260 9.97 .435 234.2 1978 31 CAL 31 31 14 3 0 10 13 0 183 106 97 148 3.72 98
8 Andy Benes 189 9.87 .300 172.1 1994 26 SDP 25 25 2 2 0 6 14 0 155 82 74 51 3.86 107
9 Jonathan Sanchez 177 9.75 .400 163.1 2009 26 SFG 32 29 1 1 2 8 12 0 135 82 77 88 4.24 101
10 Jake Peavy 215 9.56 .440 202.1 2006 25 SDP 32 32 2 0 0 11 14 0 187 93 92 62 4.09 99


Is this to suggest every man on these lists got screwed by his respective team? Maybe not. A number of factors can influence a pitcher’s win-loss record, and WAR, ERA+ and K/9 are all relative metrics that have varied between different eras in baseball history. Still, they offer a glimpse at pitchers who might have thrived in better environs.

0 0 votes
Article Rating
Subscribe
Notify of
guest

100 Comments
Oldest
Newest Most Voted
Inline Feedbacks
View all comments
AlbaNate
AlbaNate
11 years ago

Maybe better than K/9 for that last list would be K/BB. Ryan usually had a lot of both.

Doug
Editor
11 years ago

Nobody is on all three lists, but Walsh, Roberts, Schilling and Sheets are on two of them.

What a difference a decade made for Ryan. Comparing his 1987 and 1976 seasons, his K/9, a league-leading 10.4 in 1976, was still 10% higher as a 40 year-old in 1987. And, but his BB/9, even at an elevated 3.7, was down by more than a third. Net result: ERA+ up 43%.

Doug
Editor
11 years ago
Reply to  Doug

Other big differennce – Ryan had 21 CG in 1976, none in 1987.

Evan
Evan
11 years ago
Reply to  Graham

Ryan was 8-16 with a league leading ERA in 1987. Clearly his team wasn’t scoring runs for him. It being the NL and his team being behind he was probably pinch hit for in many games he might have completed had his team scored a few more runs. It doesn’t explain 0, but lack of run support should definitely suppress CG in the NL.

no statistician but
no statistician but
11 years ago

My problem with accepting WAR (Wins Above Replacement) as a very accurate assessment of pitching—if you accept it for what it purports to be—is exemplified by Clemons 7.7 in this season. I simply can’t accept the notion that any credible replacement for Clemons would have a record of 2 and 20 on a team that won more games than it lost. WAR seems to favor pitchers of a particular range of skills and/or circumstance, and I don’t argue that it tells us something about those things. I don’t think it says much about who is the most valuable pitcher in… Read more »

Neil L.
Neil L.
11 years ago

No stat, you are not a heretic at all. You must hold our feet to the fire and give the metrics a reality check.

When you put Clemon’s WAR in such stark replacement terms, it does expose the limitations of pitcher’s WAR.

I’m still a believer, though, not a heretic.

BryanM
BryanM
11 years ago

No S but.. ; Scepticism is always an appropriate reaction when one is confronted with a complex chain of math reasoning, so Keep on with it. That said, Accepting that WAR for pitchers has limitations, and it does, It is not trying to say that a replacement pitcher would have gone 2-20 in Clemens particular starts; It says nothing about what would have happened with another pitcher in those individual games. There are several gaps to close to get to what it is saying. First, it is talking about team wins , not wins attributed to the starting pitcher; Clemens… Read more »

no statistician but
no statistician but
11 years ago
Reply to  BryanM

Let’s see what minds greater than ours have to say, not on precisely this topic, but on the nature of what lies beneath the strategies and assumptions involved in something like “a complex chain of math reasoning.” John Stuart Mill: “Strange it is that men should admit the validity of the arguments for free discussion, but object to their being ‘pushed to an extreme’; not seeing that unless the reasons are good for an extreme case, they are not good for any case.” It is the last half of Mill’s argument that I believe applies here. Alfred Korzybski: “The map… Read more »

Neil L.
Neil L.
11 years ago

No stat, I haven’t a dose of philosphophy and logic like your post since my university days. 🙂

So which side do you come down on with respect to WAR? Are we relying on it blindly to provide a degree of certainty that it is not capable of providing or is it still a step forward from traditional counting stats?

no statistician but
no statistician but
11 years ago
Reply to  Neil L.

My opinion, since you ask, is that the problem lies with uncritical acceptance of it as a perfect measure for all things. It works far better for batting stats, I feel, than pitching and fielding, although, even with batting, where there is far more that is genuinely quantifiable, someone still is deciding what weight to give, what to include or ignore. Used alongside traditional stats but not as a replacement, WAR gives a view that can help clarify what went on in general terms, just as OPS+ and the other mathematical formulas (if formulas is the right term) do. Several… Read more »

BryanM
BryanM
11 years ago

No Stat — the map is not the territory, but anyone trying to negotiate an unknown territory would prefer a map to no map, and of course a good map to a bad map. I agree with what i think are your main points, and have a minor quibble with what I think is a side issue. 1) WAR for pitchers is a not-very-relaible estimate of the value of the player’s contribution over time ,born of the strong desire by some people to sum up complex issues in a single number . We have little or no idea whether a… Read more »

BryanM
BryanM
11 years ago

this is a reply to #63 – I somehow screwed up… No Stat — the map is not the territory, but anyone trying to negotiate an unknown territory would prefer a map to no map, and of course a good map to a bad map. I agree with what i think are your main points, and have a minor quibble with what I think is a side issue. 1) WAR for pitchers is a not-very-relaible estimate of the value of the player’s contribution over time ,born of the strong desire by some people to sum up complex issues in a… Read more »

John Autin
Editor
11 years ago
Reply to  BryanM

Having only skimmed this thread, I’ll have to assume that the statement “every statistic is an estimate” makes sense in the context of the exchange. Taken by itself, it’s a head-scratcher.

Ed
Ed
11 years ago
Reply to  BryanM

Interesting discussion guys! Bryan..that’s a great point re: the lack of confidence intervals. As someone whose taken multiple classes in advanced stats, that thought never crossed my mind. I’m now going to hang my head in shame.

no statistician but
no statistician but
11 years ago
Reply to  BryanM

Mill wasn’t talking about logic but about truth. Otherwise, I don’t disagree with most of what you’re saying, but I have one proviso:
lots of players have great half-seasons or great months interspersed with mediocrity, so even as a measure of “a period as short as a season” WAR as a map can fall short of depicting the territory with accuracy.

Duke
Duke
9 months ago

If a minor league replacement had the same poor run support, poor bullpen work in his starts, and same poor defense behind him then yes his W-L record would be something horrible like 2-20. Baseball Reference explains why Clemens had a 7.7 WAR that season. His innings, his run allowed vs the runs scored of his opposition, the quality of his defense, the parks he pitched in (Fenway, Camden, Tiger Stadium etc). It makes sense. Clemens had a solid year. People were really dumb to think he was done at 33 when he lead in K’s, K/9, FIP and struck… Read more »

Doug
Editor
11 years ago

I wonder if the D-Backs after the 2003 season were thinking the same way as the Red Sox after 1996. Schilling was soon to be 37, had lost time to injury in the year, and did have that 8-9 W-L. But, he also just had the best ERA+ season of his career, his 3rd best K/9 and 4th best WHIP. Schilling also pitched into the 6th inning of ALL 24 of his starts, and into the 8th inning in 14 of them. Nevertheless, Arizona just couldn’t resist Boston’s package offer of Michael Goss, Casey Fossum, Brandon Lyon and Jorge De… Read more »

DaveKingman
DaveKingman
11 years ago
Reply to  Doug

Quantity for Quality. Chuckle. Snort.

Re: Larry Himes trade of Greg Maddux from the Cubs to the Braves.

Ed
Ed
11 years ago
Reply to  DaveKingman

Huh??? Maddux signed as a free agent with the Braves. He wasn’t traded.

Evil Squirrel
11 years ago

When I think of Clemens and unlucky seasons, I always immediately think of his start to the 2005 season. 11 starts, 76 innings pitched, 1.30 ERA, and a 3-3 record to show for it. The Astros scored a whopping 18 runs for him in those 11 starts, and he got three straight no decisions in 1-0 games in which he went 7 scoreless innings. Clemens would go on to lead the NL with a 1.87 ERA and a 226 ERA+ (both personal bests), but his mediocre 13-8 record thanks to that unlucky start probably cost him the Cy Young that… Read more »

bstar
11 years ago
Reply to  Evil Squirrel

It’s only been seven years, but I think if a pitcher puts up a 223 ERA+ and only wins 13 games in 2012, he’ll win the Cy hands down. Just ask Greinke and King Felix.

Voomo Zanzibar
Voomo Zanzibar
11 years ago
Reply to  Evil Squirrel

What version of karma, Evil? Hindu, Buddhist, Falun Gong, Sikh, or Western Casual? ________________________ I’m a big Niekro fan, and it certainly was bad luck for him to be on a team with a 78 ops+, (and he didn’t help that cause (-6 ops+)), but this seems like a case where WAR is more of a Counting Stat than an indicator of quality. The Braves had 13 starters that year. Knucksie had twice the IP of the #2 guy, and more than thrice the #3. He led the league in almost every counting stat, both good and bad, except for… Read more »

Neil L.
Neil L.
11 years ago
Reply to  Voomo Zanzibar

Voomo, a great comment, if I fully catch your meaning.

Under what conditions, do you think WAR for pitchers exhibits all the weaknessness of a counting stat like wins or losses?

bstar
11 years ago
Reply to  Voomo Zanzibar

Of course, WAR is a counting stat for hitters, too.

Voomo Zanzibar
Voomo Zanzibar
11 years ago
Reply to  bstar

@21 Exactly. So what would the equivalent be for a hitter? Niekro pitched 330 innings. Here’s the NL starters for 1977: http://www.baseball-reference.com/leagues/NL/1977-starter-pitching.shtml 41 guys started at least 25 games. The low inning count was 150. 8 guys topped 240 240 Sutton 252 Reuschel 258 Ed Halicki 261 Tom Seaver 267 J.R. Richard 283 Steve Carlton 302 Steve Rogers 330 Philip Henry Niekro Just eyeballing it I’m saying that the Average Full Time Starter pitched 200 innings. There were a handful of elite workhorses and Niekro was 10% beyond the one other anomaly. Niekro pitched 1.6x the average. 12 team league… Read more »

Neil L.
Neil L.
11 years ago
Reply to  Voomo Zanzibar

~Bows in awe to Voomo.~

Wow, how do you pack so much provocative knowledge into one comment?

Voomo Zanzibar
Voomo Zanzibar
11 years ago
Reply to  Voomo Zanzibar

I am procrastinating in a major way right now, that’s how.

Here’s another delightful waste of time, if you’re into it. Just set up a free seven player challenge at fanduel for tomorrow’s afternoon games:

http://www.fanduel.com/entry/PVHGXX

Ed
Ed
11 years ago
Reply to  Voomo Zanzibar

I hear where you’re coming from Zoomo. But doesn’t Niekro deserve some credit for being able to throw more innings than other pitchers? Also, as others on here have taught me, there’s not a direct relationship between ERA+ and WAR. The latter takes into account defense and supposedly the Braves had a particularly bad defense back then.

Richard Chester
Richard Chester
11 years ago
Reply to  Voomo Zanzibar

Just for comparison purposes since 1920 there have been 27 seasons of pitchers with > 330 IP. They all have WAR of 5.1 or higher. Niekro ranks 12th in WAR (8.5), 26th in ERA+ (111), 27th in wins (16) and tied for 1st in losses (20).

Doug
Editor
11 years ago
Reply to  Voomo Zanzibar

I’m still trying to understand how Niekro could have a 111 ERA+ with a 4.03 ERA. When the NL average ERA that year was 3.91.

Can anybody explain that?

Doug
Editor
11 years ago
Reply to  Voomo Zanzibar

Thanks, Graham.

That would make sense in homer-friendly Fulton County Stadium. The pitching park factor that year was a stratospheric 117.

Ed
Ed
11 years ago
Reply to  Voomo Zanzibar

Highest ERA with an ERA+ above 110…Pedro Astacio with the 1999 Rockies. (5.04 ERA, 115 ERA+).

At the 100 ERA+ threshold, the highest ERA was 5.26 by Tommy Thomas with the 1926 Browns. (5.26 ERA, 102 ERA+).

Evil Squirrel
11 years ago
Reply to  Voomo Zanzibar

What version of karma, Evil?
Hindu, Buddhist, Falun Gong, Sikh, or Western Casual?

I don’t know. I’m lazy, so I only use Instant Karma…

kds
kds
11 years ago
Reply to  Evil Squirrel

LoL. Of course, Joe Morgan always insisted that sabermetric karma could not run over his “I was a player so I know everything about baseball analysis” dogma.

vivaeljason
vivaeljason
11 years ago

Man…Ned Garvin had an especially rough 1904. A 1.72 ERA was really good even in the deadball era…but to go 5-16?!? AND ON MULTIPLE TEAMS?!? Jeez.

Mike L
Mike L
11 years ago

The Niekro 1977 season jumps out at me. 8.5 WAR with a ERA of 4.03 and an ERA+ of 111?

Neil L.
Neil L.
11 years ago

Graham, a very technical post. I am trying to wrap my head around what Roger Clemons’ position on the first list means. First off, he didn’t appear on either of the other lists. Second, his ERA for the 1996 season was the second highest on your list and was the highest for a power pitcher. My point, Graham, is that was he not the author of his own unluckiness? His walk total and ERA, at least partially, contributed to his poor record. Granted that he could not control his team’s offense in his starts, but your WHIP will catch up… Read more »

Duke
Duke
9 months ago
Reply to  Neil L.

The AL ERA in 1996 was 5.10. To this day, the run environment of 1996 in the AL was the 3rd highest ever behind 1936 and 1930.So 3.63 was actually a solid ERA for that environment. Boston’s defense was absolutely horrible that season. Clemens had horrible run support and the bullpen blew a ton of games. His K rate was 9.5, which led the league. He led in FIP, 3.43. His HR/9 (0.7) was about 40% less than the league’s rate of 1.2 per 9. And he pitched 242 innings. Clemens 10 – 13 record is the unluckiest record in… Read more »

Neil L.
Neil L.
11 years ago

Not at all, Graham. Don’t backpeddle in the slightest. We all refine our understanding of the game by having something to reflect on and react to. Your blog, with its lists, is a nice brain-builder. It requires a careful look at the various columns in the tables to try and discern cause and effect. I personally think that the WAR’s of pitchers in the 1900-1920 era is artificially boosted by the number of innings they pitched. By artificially, I mean the quality of batters they faced. Even though they come up in P-I searches, I don’t think it is a… Read more »

kds
kds
11 years ago
Reply to  Graham

Walter Johnson says to stay off the tracks when the “Big Train” is coming.

Shping
Shping
11 years ago

Ahhh, yes, my first time back here in many months and it’s great to see that the well-rounded baseball discussions continue! And to see that the WAR debate continues as well. Thanks NoStat for the heretical/helpful example that helps validate some of my misgivings too. And yet the WAR proponents continue to respond in polite, reasonable fashion. Curse them and their open-mindedness 🙂 If the WAR-ists really want input, tho, my incredibly mundane suggestion/first step would be to multiply it by 10. It just sounds more impressive to compare an 87 score to a 77 and is easier to get… Read more »

Neil L.
Neil L.
11 years ago
Reply to  Shping

Shping, (is that code for shipping? 🙂 ), does memory fail me or were part of the B-Ref cutting-edge community?

Voomo Zanzibar
Voomo Zanzibar
11 years ago
Reply to  Shping

@22 No, I have no idea what the recipe is for WAR. Looked it up once, and concluded that even with my pretty good head for math that I couldn’t calculate it in any kind of real-time meaningful way. And what is the fun in that? …Suddenly I am nostalgic for the GWRBI. Remember that one? The guy who hit that run-scoring forceout in the 2nd inning would sometimes get the GWRBI? I was sitting with the Bleacher Creatures on June 11th, 1988 when Billy Martin started Rick Rhoden at DH. Rhoden had a sac fly that was the GWRBI.… Read more »

Shping
Shping
11 years ago
Reply to  Voomo Zanzibar

Hee hee — gotta love the GWRBI. I might have to go look up an ’80s boxscore for old times sake, just to see, underneath the line score, “GWRBI: Kittle(7)”

Will we ever see Saves the same way?

Ed
Ed
11 years ago
Reply to  Shping

Shping – I agree 100% about the need for WAR publishers to be more transparent with how the numbers are calculated. The fact that they haven’t is a big black mark in my opinion.

Shping
Shping
11 years ago
Reply to  Ed

For some reason, i’m suddenly thinking about Tom Hanks in “Castaway”, trying to figure out the sounds of the coconuts falling from trees, saying, “What is that?!?”

bstar
11 years ago
Reply to  Shping

Well, shping, WAR does stand for Wins Above Replacement, so an 8.7 WAR player contributed ~9 more wins to his team than a replacement player would have. That’s pretty easy to digest and understand. It helps us to better quantify, “How much is player X actually helping his team?”. If you multiply it by 10, you would have to change the name of the stat and “87” would just become an arbitrary score. I, like you, was slow to warm to the WAR concept. But once you spend time with it, you learn to think in WAR terms. For example,… Read more »

Shping
Shping
11 years ago
Reply to  bstar

I know, i know, thanks. I can see how it becomes a matter of habit. I’m trying to embrace it. Still trying. Getting better with OPS. But i still need to be able to grasp exactly what the stat is saying in a way that WAR doesn’t do for me.

And i still say moving the decimal would help WAR. Just like we know that a “500” team is actually a 50 percent team that wins half its games, i think it would work the same way.

Shping
Shping
11 years ago
Reply to  Graham

Ok, the percent thing is another way of looking at it.

But i think i’ll still invoke the name of the almighty Bill James here. Didnt he once do some kind of rating system for new stats, based on categories like Accuracy, Reliability, Trendability, Digestibility, something like that? Ring a bell with anyone? (WAR, for instance, would probably score high in specificity or comparability, but low in transparency ).

Perhaps i’ll do some research on that and report back at a later date. Meanwhile, we got another weekend of baseball to enjoy!

Ed
Ed
11 years ago
Reply to  Graham

Along the same line, here’s an interesting article by Bill Simmons on some of the new baseball stats. I think he does a good job of capturing how the “typical” fan reacts to these sorts of things:

http://sports.espn.go.com/espn/page2/story?page=simmons/100402

bstar
11 years ago
Reply to  Graham

Ed, that was a fun ride of an article. I have never read much Simmons for whatever reason; his humor really stood out a lot. Is he always this good of a writer?

Ed
Ed
11 years ago
Reply to  Graham

Glad you enjoyed it. Yeah, he’s a very good writer and quite humorous. You have to put up with his Boston homersim (which he fully admits to) and he makes a lot of tv/popular cultural references that I don’t get since I don’t follow pop culture that much. Still, I think he represents an interesting viewpoint – that of the “common man” who’s slowly starting to get the more advanced metrics, not just for baseball but for other sports as well. Sadly, he doesn’t write as many articles as he used to. In the past, he would write 2-3 articles… Read more »

kds
kds
11 years ago
Reply to  Graham

We have general habits of using 1/100, i.e. %, but there is nothing necessary about that. Same for BA, OBP, SLG where we use hundredths. Since these are all rate numbers it doesn’t matter what we call it. 25%, .250 or .01 (base2) are all the same thing. But WAR is a counting stat, not a rate stat. It would make no sense to multiply HR or RBI or pitchers’ Wins by 10. We want to use WAR for things like analysing trades, free agent signings. Multiplying by 10 never helps, and makes it more complicated to use WAR with… Read more »

kds
kds
11 years ago
Reply to  Graham

I meant thousandths for BA, OBP, SLG. (A BA of 3 hundred is 300/1000.)

kds
kds
11 years ago
Reply to  Graham

The Bill James article on rating stats by Importance, Reliability, and Intelligibility is in the 1987 Abstract. He does not look at any “higher order” stats like WAR. Because few had been developed by that time. Of course he basically invented WAR in the article “MVP” pg 164-167 of that volume.

Hartvig
Hartvig
11 years ago
Reply to  Shping

My issue with OPS/OPS+ is that it tends to undervalue players like Rickey Henderson and Tim Raines and even though I know when I look at their numbers I still get this visceral “Oh… well…” feeling in my gut.

My issue with WAR is a) it’s incredible complexity and b) the fact that there is more than one formula and, on occasion at least, the outcomes vary wildly.

Shping
Shping
11 years ago
Reply to  Hartvig

Thanks for the input guys — enjoyed all of the info and references. Like i said, i’m trying to embrace WAR. Lots of objections still, as are being discussed here, along with simple issues of lifelong habits, stubborness and fact that we all need to remind ourselves from time to time that no stat of any type is perfect, the ultimate 3.1444 or e=mc2 that ultimately explains everything! 🙂 I don’t think or hope that anyone is actually saying that about WAR, right? But i think we’re guilty on both sides of the WAR arguments at times of either trying… Read more »

kds
kds
11 years ago
Reply to  Shping

rWAR for a starting pitcher. Take his RA, not earned runs. Compare this to his adjusted replacement level. Gross replacement level is about 1.25* league average RA. (Maybe a little less, (1.2?) I’m reverse engineering this from various sources.) You then adjust for park. And for his team’s defense. So a bad defense, in a hitter’s park, in a high scoring league, will give you a higher replacement level than if you reversed all those factors. 1996 was a high scoring year in the AL. Fenway had a one year park factor of 106. The Sox’s defense suxed that year… Read more »

Neil L.
Neil L.
11 years ago
Reply to  kds

I don’t think I’ve read as succinct a description of RAR as yours, kds.

One newbie question, though. Why is the gross replacement level for pitchers about 1.2-1.25 times the league average runs allowed? Why is not the league average itself?

Is the bump up of RA the differene between a replacemnet pitcher from the minors and an average major-leaguer?

May be a dumb question but thanks in advance for your patience.

kds
kds
11 years ago
Reply to  Neil L.

Your second paragraph basically has the correct answer to the first. Replacement level is what can get, from your minors or free agent signing, waiver deals, etc., for the league minimum. You should expect these to be worse than league average. A big question, not totally settled is where is replacement level? Tom Tango (insidethebook.com) thinks that replacement level for a starting pitcher is W/L = .380. I think this means that if you took the theoretical replacement level pitcher, had him pitch for a gazillion innings, (to avoid sample size issues), looked at his runs allowed, gave his “team”… Read more »

Shping
Shping
11 years ago

…But i also had to ask Voomo @ 14 and the EvilSquirrel @10:

Which kind of karma would bring Clemens back as, say, Steve Bartman, or Mario Mendoza?, or young Moonlight Graham? I’ll pick that one. (Samsara?)

Shping
Shping
11 years ago

Hey Neil — Ha ha, thanks i guess. I dont think i could ever describe myself as cutting edge, but i was definitely a BB-Ref devotee/participator for awhile in 2010-11. And i does like to ramble on at times and think about this amazing game of baseball and all the glorious history and stats and comparisons and random possibilities, and hear others do same — so be forewarned, i’ll be visiting again! — defending Braun’s honor as often as necessary too!

Neil L.
Neil L.
11 years ago
Reply to  Shping

Shping, be a regular here.

bstar
11 years ago
Reply to  Shping

I will happily defend Ryan Braun’s honor with you, shping. Guilty until proven 100% innocent turns my stomach.

Shping
Shping
11 years ago

Wow, an invitation even. Ok, no problem.

Teach me something about triples this year. Possibilities, trends, ballpark tendencies, something. Kind of a pet curiosity of mine.

kds
kds
11 years ago
Reply to  Shping

Triples have the highest Home Field Advantage of any batting event. Probably because home outfielders are much better at judging the wall and how the ball will bounce. They have a high variability in park factor also. Partly because of park size and partly because of the way the shape and texture of the outfield walls affects the number of odd bounces.

Neil L.
Neil L.
11 years ago
Reply to  kds

Why are triples on a slow but steady decline as a percentage of all plate appearances? Shouldn’t new “retro” parks have the dimensional quirkiness to create more triples?

I can vaguely remember having a few discussions about triples and their frequency back in B-Ref, but don’t recall any firm conclusions being reached.

Ed
Ed
11 years ago

Braun has honor???? 🙂

Ed
Ed
11 years ago
Reply to  Ed

Oops…in reply to Shping #26.

Shping
Shping
11 years ago

Hmmmm, not sure if or where the sarcasm is with Braun — i’ll accept your offer though bstar! — and we can save that topic for another time.

How bout that Ben Sheets as one of the unluckiest pitchers of all time, on two lists? He was dominating in Milw. Wonder if he’ll ever pitch again.

Could we possibly see a list of the luckiest picthers of all time? I’ll bet Pete Vuckovich is on that one, bless his heart!

Shping
Shping
11 years ago

I won’t deny that WAR can be useful, especially when it suits my purposes. 🙂 Sheets was unlucky.

Poor Matt Cain too. Easy guy to root for.

John Williams
John Williams
11 years ago

With WAR, ERA+ and K/9 as newer metrics; could be see a starting pitching with .500 or even a losing record win or finish in the top 5 or so for a Cy Young? Félix Hernández won in 2010 with 13-12 2.27. Below is a list of the .500 and under starting pitchers with votes for the Cy Young Award. 12 are on the list with 9 in the NL and half since 2000. Also, 9 were finished 6th place or lower in the voting and none in the top three. 2 are Hall of Famers (Ryan and Perry) and… Read more »

BryanM
BryanM
11 years ago

John Autin @68 A definition of statistical inference is the drawing of conclusions about a population by studying a sample (usually a random sample ) a true statistic (say the mean, or Standard deviation of the sample ) is always an estimate or guess about the population . (the mean of the sample is not the mean of the population) WAR is a detailed calculation, which is an estimate ,or guess, about the impact of a player on a team’s wins it doesn’t measure wins, it measures other things , and uses those things to guess about wins. The thread… Read more »

bstar
11 years ago
Reply to  BryanM

Bryan, I think WAR for starting pitchers is pretty solid. If you look at the all-time leaders in b-ref WAR, this list really, really passes the sniff test. There aren’t a whole lot of surprises; whomever, by general opinion, you think should be at the top is at the top and it just flows down from there. It’s actually pretty hard to look at one pitcher and say, “Wow, his WAR total looks way too high/low.” It’s a pretty solid list overall. What about relievers? Despite a lot of people making noise that perhaps leverage index should be weighted more… Read more »

BryanM
BryanM
11 years ago
Reply to  bstar

bstar, so do I . I made the same point about the career list that you do , back in @67 (thread starts @5) , when the sample size gets smaller ,like a season, the estimate gets less accurate, but in my opinion, still pretty good. The debate here is about how good . In 1996 Clemens went 10-13 for the Sawks with a 7.7 WAR , got no Cy votes , a young Andy Pettitte went 21-8 with a 5.7 WAR, 2nd in CY. Now was Clemens better than Pettitte in 1996, or not? Without confidence intervals, we can’t… Read more »

Duke
Duke
9 months ago
Reply to  BryanM

Clemens was the best pitcher in the AL back in 1996. His peripherals were better than anyone else. K/9 was the second highest in his career at that point.( 9.5) He allowed 40% less homeruns than league average, and led in FIP. He did walk alot of guys..but the Sox defense was horrible that season. Why anyone thought Clemens was “done” at 33, when he lead the league in Ks, K/9 and struck out 20 in a game is just another example of how dumb Duquette was, and how over reliant fans and media were with W-L record Under today’s… Read more »

Neil L.
Neil L.
11 years ago
Reply to  BryanM

Trying to distil your thoughts down about WAR as a reliable statistic for pitcher quality, Bryan, while not being perfect, do you think it is the best we’ve got?

BryanM
BryanM
11 years ago
Reply to  Neil L.

Neil — I thinks it’s close for starters , perhaps RA+ would be a little better — ( add back the unearned runs and recalculate ERA+ ) ERA is biased in favor of weaker pitchers. If the Phillies give the other team 4 outs with Doc on the mound , it may not matter – if a bad pitcher allows 3 unearned after an error, his ERA looks fine, he’s still pitched badly, the team still loses.
For relievers , i think WHIP (include HBP?) is better because of the whole inherited runner issue. What are your thoughts/

Neil L.
Neil L.
11 years ago
Reply to  BryanM

At risk of sounding like an HHS “recruiter”, will you post more frequently here or are you too involved at other baseball sites?

Neil L.
Neil L.
11 years ago
Reply to  BryanM

Don’t want to give you a superficial reply, Bryan, so I won’t respond in detail until tomorrow. I’m finding my own thinking is being clarified by your posts.

I’m working a little mini-study involving teams blowing four saves in their first eight games …. trying to find how rare/common it is.

I hope to put the results up in a few minutes, perhaps Doug’s blog. (I realize it will be a hijack, but there is nowhere else to send it)

Thanks.

John Autin
Editor
11 years ago
Reply to  Neil L.

Neil, remember that not all blown saves happen in the late innings. Every reliever who enters with a lead* is technically working on a save opportunity, even if he’s a LOOGY working in the 6th inning.
__________
*And when a prior pitcher is eligible for the win.

John Autin
Editor
11 years ago
Reply to  Neil L.

To clarify … I sense that you are seeking a cohort for Toronto’s early-season experience of blowing 4 late leads. My point is that searching for “blown save” is not necessarily going to pick out the games you’re after. If you want to catch late-inning blown saves, you might consider adding a WPA requirement — say, minus-0.25 or less.

Neil L.
Neil L.
11 years ago
Reply to  Neil L.

I’m just using the official version of the stat, JA. Of course, what you say in true.

John Autin
Editor
11 years ago
Reply to  BryanM

Bryan, thanks for the capsule summary.

I can tell you know your way around the subject and the language (probably better than I do), so you don’t need me to tell you that using the word “statistic” with no modifier in a sentence like “every statistic is an estimate” is liable to create confusion among an audience to whom “statistics” include simple counts of events like home runs and wins.

So, um, I won’t say that. 🙂

BryanM
BryanM
11 years ago
Reply to  John Autin

John , Thanks for that . What our exchange has shown me is that I need to be more careful in my terminology. When we say that Albert Pujols drove in 99 runs last year, calling that a “statistic” is just fine, and it’s clearly not an estimate (but what about Hack Wilson’s 190 or 191? ) . Emphasizing “guess” in my posts is a way of protesting against the sort of argument that goes “A had more blue jellybeans than B last year, had almost twice as many red marbles, and was only picked off once, so clearly A… Read more »

BryanM
BryanM
11 years ago
Reply to  John Autin

Part deux .. a long digression. I remember precisely when the light went on for me about counting one thing when you are really interested in something related, but different. Fall of 1970, the Orioles were on a tour of Japan after the season , and with the no-split-screen TV technology of the day, they were interviewing Earl Weaver in the dugout with the game in progress. Earl had his back to the field when Frank Robinson’s bat made an unmistakeable sound. Weaver, who was in mid sentence, interrupted himself to say “there’s two” in a calm voice without looking… Read more »

BryanM
BryanM
11 years ago
Reply to  BryanM

there was a man on base at the time

Shping
Shping
11 years ago

Great allegory BryanM.

And i thought you were simply going to tell the story about Weaver, sitting in his underwear in office and smoking a stogie during the natl anthem, telling a reporter, “Don’t worry kid. We do this every day”!

That aside, it seems like Weaver was definitely ahead of his time in a lot of what we call sabermetric thinking. (“The heck with wasting an out on a sacrifice; i’ll take my chances with the lefty hitting a 3-run homer” is a deceptively simple yet advanced point of view.)

Andy
Admin
11 years ago

Graham has come out on HHS on fire, generating tons of comments on his weekly posts…wow!

Neil L.
Neil L.
11 years ago
Reply to  Andy

I’ve learned a lot about WAR, Andy, from Graham’s post and the intelligent debate in the comments.

A lot of things I felt hesitant to ask about WAR for pitchers.

Kudos, Graham!! ~thumbs up, high five~

BryanM
BryanM
11 years ago
Reply to  Andy

Graham has hit rich vein of ore with a simple proposition ; Find one stat that “says” not very good, correlate it with one that says “real good” ,add an emotion-soaked adjective like “unlucky” and stand back. This thread is/was great and I have learned a lot from Graham and the other posters,

Much thanks, Graham