Category Archives: Probabilistic Model of Range

November 22, 2016

Pitchers and Defense

Baseball Think Factory links to a developing debate between Joe Posnanski and Sean Forman about the defensive adjustments to pitcher WAR at Baseball Reference. As someone who dabbled in measuring defense, figuring out how much some defense helped or hurt a particular pitcher is a tough problem.

The Posnanski piece wonders why rWAR rates Justin Verlander so much higher than Rick Porcello. The difference comes down to a defensive adjustment. The Red Sox rate as an excellent defense, the Tigers as a poor defensive team. Posnanski’s argument is that the Tigers played great defense behind Verlander, as his BABIP was .256 versus the team BABIP of .302. Forman’s argument is that the team’s defense is the team’s defense, and there’s no reason to think they got good because Verlander was on the mound. There is a lot of random variation in baseball, because nearly every set of individual stats is a small sample size.

When I worked on the Probabilistic Model of Range, one thing I liked to do was look at the expected outs a pitcher produced versus the actual outs recorded. Here’s a post from the end of the 2011 season, when Verlander won the Cy Young Award:

Notice that Justin Verlander and Josh Beckett both produced high expected DERs, but the defense magnified that by turning so many more batted balls into outs. Verlander pitched very well, but his defense gave him that extra push into the Cy Young.

Maybe that happened again. Maybe Verlander is just so good at inducing poor contact that even a bad defensive team looks good. I would love to know the expectations for Verlander’s and Porcello’s balls in play.

May 25, 2015

Crediting Pitchers and Fielders

Tom Tango and Phil Birnbaum discuss crediting pitchers and fielder properly on outs and hits on balls in play.

When I worked on the Probabilistic Model of Range (PMR), I liked to list PMR behind pitchers. The display would show the probability of the pitcher’s balls in play being turned into outs, and the actual percentage of outs recorded by the fielders. I ran it as a way of seeing pitchers who were helped or hurt by their defense, but it also showed which pitchers were doing a good job of inducing easy or tough to field balls in play.

It strikes me that this could be used as a way to properly weight credit to the pitcher and the fielder. Pitchers who induce easier to field batted balls simply get more credit that those who make life tough for the fielders. I’m sure some clever person could work this out.

March 2, 2014

Thought Experiment

Given that new defensive data will be available, how do we use it to measure range and positioning? Range in this case represents the ability to make plays by moving to a ball, positioning represents the ability to be in the right place at the right time.

A concept that will be useful to positioning for grounders and low line drives is shortest distance to the ball. Imagine the fielder standing at a fixed point on the field when the ball is put into play (not necessarily true, but a good first approximation). With home plate as the origin (the intersection of the foul lines), we can draw a line from the origin through the position of the fielder. We can also draw a line that represents the path of the ball. The shortest distance to the ball will be along a line from the fielder that intersects the path of the ball, perpendicular to the path of the ball.

Players who position themselves well should have a high probability of a low distance to the ball. That’s not quite enough, however. The straight away positions for the players are based on the distribution of batted balls, set where there is the highest local probability of a ball being hit. By standing at a straight away position, a fielder is giving himself a good chance to being close to a batted ball in his zone of opportunity.

The closeness probabilities for a fielder should be compared to the closeness probabilities if the fielder always played straight away. In other words, can a fielder to better than the default.

For balls higher in the air, we simply want to measure where the ball landed compared to where the fielder started, but once again we want to compare the result to a straightaway fielder.

Distance to the ball also comes into the range calculation. For that we want to measure the probability of making a play given the distance to the ball. The assumption is that the further away from the ball, the lower the probability of making a play. If we calculate the rate of decline in probability as a function of distance, players with great range should have a rate that is less negative. If this relationship was linear, we would say that players with great range have a less negative slope.

We could then rate players in a given season on a two dimensional scale of range and positioning. The fielders who performed best at PMR or UZR would likely do well in both positioning and range. Those at the other end might be poor at both positioning and range. Players with good range but poor positioning might look good, since their range can cover the weakness of position. Players with good position and bad range might look average. They would get bonuses for the times they were positioned well, but lose points for easy plays they were not close enough to convert.

Positioning may be a team skill more than a player skill. It will be interesting to see if we can separate out the two. For example, we might look for players who position themselves better than the team as a whole. We might look for players who are consistently good or bad from team to team. Do more experienced players position themselves better than less experienced players?

I look forward to seeing the research that falls out of this data.

March 22, 2012

More on Shifting

Earlier today I linked to a Bill James piece on using the shift against David Ortiz.

Having thought about it some more, this struck me as interesting:

Sounds a little low, but let’s go with it. To show the unique importance of David Ortiz in this discussion: The Shift has been used against Ortiz, over the two years, 486 times, not counting the strikeouts and walks and such like, but the most times that any team has USED the shift over the two years is 437, by Tampa Bay, and no other team is even over 300. It’s a definition of dominating a category. Remember all those articles pointing out that Babe Ruth hit more home runs than any other TEAM in 1920? Same thing; the number for an individual is greater than the number for any team. We should also remember to give The Fielding Bible credit for giving us hard data as to the number of times in a season Tampa Bay uses the shift.

Now, James discusses specifically using the shift against Ortiz, but there’s nothing special, as far as I can tell, about Ortiz as a left-handed pull hitter. Is he all that different from others who see the shift, like Mark Teixeira or Jason Giambi? My answer at the moment is I don’t know.

I do find it interesting, however, that Tampa Bay uses the shift extensively and ranks near the top in defensive metrics. The objective probabilistic model of range (PMR) rates them the best defensive team in the majors in 2011, as does UZR. Shifting for the Rays doesn’t seem to be hurting them, since they’re making plays most teams don’t. I suspect the Rays believe it helps, or they wouldn’t do it so often. I’d like to see if they always shift against specific batters, or if the batter/pitcher combination makes a difference.

March 8, 2012

Objective PMR, Pitcher Support

One nice thing about the Probabilistic Model of Range (PMR) is that it can be used to explore more than fielding skill. Applied to the defense behind a pitcher, it can be a measure of how much that hurler was helped or hurt by his defense. The following chart shows objective PMR stats for defenses with a particular pitcher on the mound in 2011, minimum 300 balls in play:

Objective PMR, performance behind pitchers. Model built on data from 2005-2010, visiting teams only.
Pitcher In Play Actual Outs Predicted Outs DER Predicted DER Index
Josh Tomlin 525 387 356.4 0.737 0.679 108.6
Josh Beckett 511 382 356.2 0.748 0.697 107.3
Justin Verlander 635 477 445.5 0.751 0.702 107.1
Jeremy Hellickson 560 431 404.4 0.770 0.722 106.6
Randy Wolf 636 447 420.0 0.703 0.660 106.4
Doug Fister 669 478 453.7 0.714 0.678 105.4
Ricky Romero 620 457 434.8 0.737 0.701 105.1
Kyle Lohse 525 375 357.3 0.714 0.681 105.0
Kyle Kendrick 346 251 239.1 0.725 0.691 105.0
Jeff Niemann 407 292 278.4 0.717 0.684 104.9
Randy Wells 393 277 264.2 0.705 0.672 104.8
Chris Narveson 473 328 313.6 0.693 0.663 104.6
Blake Beavan 332 235 224.9 0.708 0.678 104.5
Joe Saunders 652 469 448.8 0.719 0.688 104.5
Ervin Santana 665 474 454.3 0.713 0.683 104.3
James Shields 655 478 458.4 0.730 0.700 104.3
Fausto Carmona 606 420 403.2 0.693 0.665 104.2
Gavin Floyd 569 405 389.0 0.712 0.684 104.1
Jered Weaver 649 481 462.0 0.741 0.712 104.1
Ian Kennedy 563 405 389.0 0.719 0.691 104.1
Jhoulys Chacin 502 363 349.3 0.723 0.696 103.9
Alexi Ogando 500 360 347.0 0.720 0.694 103.7
Tommy Hanson 316 225 217.0 0.712 0.687 103.7
Carlos Villanueva 339 242 233.6 0.714 0.689 103.6
Johnny Cueto 428 310 299.5 0.724 0.700 103.5
Alfredo Aceves 326 246 238.0 0.755 0.730 103.4
Javier Vazquez 549 388 375.1 0.707 0.683 103.4
Luke Hochevar 615 440 425.6 0.715 0.692 103.4
Philip Humber 499 357 345.8 0.715 0.693 103.2
Cole Hamels 542 391 378.9 0.721 0.699 103.2
Dan Haren 703 502 486.9 0.714 0.693 103.1
Ivan Nova 530 371 360.0 0.700 0.679 103.1
Jeremy Guthrie 629 445 431.5 0.707 0.686 103.1
Bronson Arroyo 650 464 450.2 0.714 0.693 103.1
Carlos Carrasco 368 256 248.6 0.696 0.675 103.0
Jair Jurrjens 453 327 317.6 0.722 0.701 103.0
Guillermo Moscoso 398 300 292.2 0.754 0.734 102.7
David Price 606 428 416.9 0.706 0.688 102.7
Tim Stauffer 524 372 362.2 0.710 0.691 102.7
Tim Hudson 612 434 423.6 0.709 0.692 102.5
Chris Carpenter 664 445 434.1 0.670 0.654 102.5
Wandy Rodriguez 505 350 342.1 0.693 0.677 102.3
Jordan Lyles 303 204 199.4 0.673 0.658 102.3
Freddy Garcia 466 323 316.0 0.693 0.678 102.2
Jason Hammel 531 371 363.5 0.699 0.685 102.1
Homer Bailey 386 270 264.4 0.699 0.685 102.1
Ted Lilly 516 375 367.2 0.727 0.712 102.1
Carlos Zambrano 437 304 297.7 0.696 0.681 102.1
John Lannan 568 385 377.9 0.678 0.665 101.9
Cliff Lee 594 417 409.3 0.702 0.689 101.9
Josh Collmenter 456 338 331.7 0.741 0.727 101.9
Vance Worley 355 250 245.4 0.704 0.691 101.9
J.A. Happ 411 279 274.0 0.679 0.667 101.8
Dillon Gee 465 330 324.5 0.710 0.698 101.7
Michael Pineda 446 324 319.5 0.726 0.716 101.4
Jon Lester 491 347 342.5 0.707 0.697 101.3
Jake Peavy 338 230 227.3 0.680 0.673 101.2
Mike Leake 454 319 315.1 0.703 0.694 101.2
Dustin Moseley 376 264 261.0 0.702 0.694 101.1
Kyle McClellan 457 323 319.5 0.707 0.699 101.1
Bud Norris 468 324 320.9 0.692 0.686 101.0
Shaun Marcum 532 381 377.6 0.716 0.710 100.9
C.J. Wilson 609 422 418.4 0.693 0.687 100.9
Bruce Chen 482 341 338.4 0.707 0.702 100.8
Scott Baker 375 261 258.9 0.696 0.691 100.8
Brandon McCarthy 531 365 362.5 0.687 0.683 100.7
Edwin Jackson 613 402 399.5 0.656 0.652 100.6
Livan Hernandez 563 382 379.7 0.679 0.674 100.6
Colby Lewis 571 414 411.4 0.725 0.720 100.6
Justin Masterson 643 436 433.7 0.678 0.675 100.5
A.J. Burnett 541 370 368.7 0.684 0.682 100.4
Brett Cecil 376 272 270.9 0.723 0.720 100.4
Tim Wakefield 482 343 341.6 0.712 0.709 100.4
Wade Davis 596 422 420.4 0.708 0.705 100.4
Brandon Morrow 471 320 318.6 0.679 0.677 100.4
Hiroki Kuroda 566 399 397.2 0.705 0.702 100.4
Carl Pavano 753 509 507.0 0.676 0.673 100.4
Anthony Swarzak 346 238 237.4 0.688 0.686 100.3
Chris Volstad 512 344 343.1 0.672 0.670 100.3
Erik Bedard 353 243 242.4 0.688 0.687 100.3
Jake Westbrook 574 387 385.9 0.674 0.672 100.3
Jason Marquis 405 274 274.4 0.677 0.677 99.9
Jordan Zimmermann 466 325 325.2 0.697 0.698 99.9
Jake Arrieta 330 234 234.3 0.709 0.710 99.9
Ryan Dempster 564 376 376.4 0.667 0.667 99.9
Clayton Kershaw 540 383 383.9 0.709 0.711 99.8
Gio Gonzalez 532 369 370.2 0.694 0.696 99.7
Matt Cain 303 217 217.6 0.716 0.718 99.7
Jason Vargas 640 445 446.6 0.695 0.698 99.6
Joel Pineiro 513 340 341.4 0.663 0.666 99.6
Tyler Chatwood 468 312 313.4 0.667 0.670 99.5
Zachary Britton 475 324 325.5 0.682 0.685 99.5
Chad Billingsley 513 350 352.2 0.682 0.687 99.4
Brett Myers 604 414 416.3 0.685 0.689 99.4
Derek Holland 586 394 396.7 0.672 0.677 99.3
Mark Buehrle 681 472 475.2 0.693 0.698 99.3
Felix Hernandez 649 440 443.8 0.678 0.684 99.1
R.A. Dickey 611 429 433.1 0.702 0.709 99.1
Francisco Liriano 383 267 269.8 0.697 0.704 99.0
Trevor Cahill 634 428 433.3 0.675 0.683 98.8
Roy Halladay 650 449 454.9 0.691 0.700 98.7
Matt Harrison 574 393 398.4 0.685 0.694 98.6
Nick Blackburn 497 324 328.9 0.652 0.662 98.5
Zack Greinke 425 279 283.4 0.656 0.667 98.4
Bartolo Colon 495 337 342.7 0.681 0.692 98.3
Ubaldo Jimenez 525 348 354.5 0.663 0.675 98.2
Yovani Gallardo 554 382 389.2 0.690 0.703 98.2
Brad Penny 623 420 428.0 0.674 0.687 98.1
Mike Pelfrey 624 425 433.7 0.681 0.695 98.0
Chris Capuano 517 346 353.4 0.669 0.684 97.9
CC Sabathia 670 443 454.1 0.661 0.678 97.6
Anibal Sanchez 500 332 340.3 0.664 0.681 97.6
Max Scherzer 551 373 382.5 0.677 0.694 97.5
Jaime Garcia 570 377 387.2 0.661 0.679 97.4
Brad Bergesen 332 225 231.4 0.678 0.697 97.2
John Danks 521 352 362.6 0.676 0.696 97.1
Madison Bumgarner 300 203 209.1 0.677 0.697 97.1
Daniel Hudson 595 402 415.0 0.676 0.697 96.9
Mat Latos 481 335 345.9 0.696 0.719 96.9
Clayton Richard 314 211 217.7 0.672 0.693 96.9
Jo-Jo Reyes 479 317 327.6 0.662 0.684 96.8
Matt Garza 523 347 358.3 0.663 0.685 96.8
Ricky Nolasco 634 414 427.5 0.653 0.674 96.8
Aaron Cook 343 223 230.3 0.650 0.671 96.8
Brian Duensing 502 330 341.8 0.657 0.681 96.6
Rick Porcello 587 388 401.7 0.661 0.684 96.6
John Lackey 540 351 363.9 0.650 0.674 96.5
Derek Lowe 607 397 411.2 0.654 0.677 96.5
Alfredo Simon 357 238 246.7 0.667 0.691 96.5
Aaron Harang 477 334 346.3 0.700 0.726 96.5
Brandon Beachy 342 231 239.7 0.675 0.701 96.4
Jeff Francis 649 433 449.6 0.667 0.693 96.3
Phil Coke 356 233 242.2 0.654 0.680 96.2
Jonathon Niese 494 320 333.1 0.648 0.674 96.1
Danny Duffy 316 209 218.7 0.661 0.692 95.6
Roy Oswalt 430 294 310.6 0.684 0.722 94.7
Felipe Paulino 380 248 263.1 0.653 0.692 94.3

Notice that Justin Verlander and Josh Beckett both producEd High expected DERs, but the defense magnified that by turning so many more batted balls into outs. Verlander pitched very well, but his defense gave him that extra push into the Cy Young.

At the other end of the spectrum, the Phillies defense hurt Roy Oswalt. His balls in play were relatively easy to field, but the Philadelphia defense let Roy down. A team with a good defense might do well to lure Oswalt to their mound.

Those predicted DER numbers should not be taken as an absolute, however. Parks play a big factor in those. We would expect them to be lower in Coors, for example, than in PETCO. So the index represents how much the defense helped or hurt the pitcher, but the predicted DERs should not be used by themselves to rate the hurler.

In objective PMR, where we don’t measure the direction or the speed of the ball off the bat, it could be that someone like Justin Verlander was helping his defense, instead of the other way around. When batters have trouble making contact against a pitcher, they probably have trouble making good contact as well. It’s possible that the balls in play against Justin were simply easier to field.

If you like the objective PMR series, please consider donating to the Baseball Musings Pledge Drive.

March 7, 2012

Objective PMR, Pitchers

The series on objective probabilistic model of range (PMR) continues by looking at pitchers. Like catchers, range is a small component of a pitcher’s defensive value. I’ll show teams as a whole at the position, plus individuals who were on the field for 300 balls in play. First the teams:

Objective PMR, team pitchers, 2011. Model built on data from 2005-2010, visiting teams only.
Team In Play Actual Outs Predicted Outs Actual DER Predicted DER Index
SDN 3083 217 173.1 0.070 0.056 125.4
CLE 3389 200 166.5 0.059 0.049 120.1
LAN 2666 195 163.9 0.073 0.061 118.9
ARI 2960 183 158.3 0.062 0.053 115.6
COL 3053 213 187.4 0.070 0.061 113.7
NYN 3089 224 199.9 0.073 0.065 112.1
BAL 3180 189 169.7 0.059 0.053 111.4
ATL 2960 214 195.1 0.072 0.066 109.7
TOR 3106 173 160.9 0.056 0.052 107.5
MIL 2938 188 175.8 0.064 0.060 107.0
KCA 3241 192 181.6 0.059 0.056 105.7
OAK 3163 167 159.8 0.053 0.051 104.5
TBA 3010 153 146.6 0.051 0.049 104.3
SFN 1447 91 87.9 0.063 0.061 103.5
CHA 3292 190 184.2 0.058 0.056 103.2
SEA 3408 170 165.3 0.050 0.049 102.8
FLO 3114 179 178.4 0.057 0.057 100.3
ANA 3147 166 168.6 0.053 0.054 98.5
CHN 3002 167 172.2 0.056 0.057 97.0
SLN 3192 178 186.4 0.056 0.058 95.5
PIT 1587 90 94.6 0.057 0.060 95.2
BOS 2956 134 141.0 0.045 0.048 95.1
DET 3346 149 157.7 0.045 0.047 94.5
PHI 3053 169 179.0 0.055 0.059 94.4
TEX 3166 153 162.6 0.048 0.051 94.1
CIN 3046 160 180.7 0.053 0.059 88.5
HOU 2882 164 186.5 0.057 0.065 88.0
NYA 3251 163 186.4 0.050 0.057 87.5
MIN 2983 166 190.7 0.056 0.064 87.0
WAS 3085 172 200.5 0.056 0.065 85.8

It’s tough to get a ball through the Padres mound.

The individuals:

Objective PMR, individual pitchers, 2011. Model built on data from 2005-2010, visiting teams only. 300 balls in play, minimum.
Fielder In Play Actual Outs Predicted Outs Actual DER Predicted DER Index
R.A. Dickey 452 52 29.5 0.115 0.065 176.1
Tim Stauffer 453 45 25.8 0.099 0.057 174.6
Jeremy Guthrie 446 35 21.6 0.078 0.048 162.2
Ryan Dempster 439 38 24.1 0.087 0.055 157.9
Josh Tomlin 382 26 17.0 0.068 0.045 152.6
Hiroki Kuroda 436 38 26.3 0.087 0.060 144.7
Doug Fister 566 37 25.9 0.065 0.046 142.8
Livan Hernandez 407 35 24.9 0.086 0.061 140.3
Jake Westbrook 476 41 29.2 0.086 0.061 140.2
Justin Verlander 495 30 21.4 0.061 0.043 140.0
Jair Jurrjens 300 27 19.4 0.090 0.065 139.3
Jhoulys Chacin 392 37 26.9 0.094 0.069 137.7
Dustin Moseley 313 24 17.5 0.077 0.056 137.0
Jeff Niemann 317 22 16.1 0.069 0.051 136.8
Clayton Kershaw 355 30 22.0 0.085 0.062 136.5
Mat Latos 375 26 19.3 0.069 0.052 134.5
Jonathon Niese 375 36 27.8 0.096 0.074 129.6
Bruce Chen 322 24 18.5 0.075 0.058 129.5
Josh Collmenter 313 18 14.2 0.058 0.045 126.5
Matt Harrison 418 31 25.0 0.074 0.060 124.0
Randy Wolf 456 30 24.3 0.066 0.053 123.3
Dan Haren 512 32 26.0 0.063 0.051 123.2
Bronson Arroyo 476 32 26.0 0.067 0.055 122.8
Joe Saunders 474 35 28.7 0.074 0.060 122.1
Ricky Romero 432 35 28.7 0.081 0.066 122.0
Michael Pineda 352 18 14.8 0.051 0.042 121.6
Shaun Marcum 378 25 20.8 0.066 0.055 120.4
Jason Hammel 408 28 23.6 0.069 0.058 118.6
Carl Pavano 534 37 31.3 0.069 0.059 118.4
Ted Lilly 317 22 18.7 0.069 0.059 117.6
Ian Kennedy 405 23 19.6 0.057 0.048 117.3
Javier Vazquez 387 22 19.0 0.057 0.049 115.9
Bud Norris 341 24 20.9 0.070 0.061 114.8
Justin Masterson 522 31 27.2 0.059 0.052 114.1
Yovani Gallardo 411 28 25.1 0.068 0.061 111.8
Josh Beckett 363 19 17.1 0.052 0.047 111.2
Edwin Jackson 492 27 24.3 0.055 0.049 111.1
Mark Buehrle 503 35 31.6 0.070 0.063 110.6
Anibal Sanchez 387 24 21.7 0.062 0.056 110.4
Ubaldo Jimenez 400 25 22.8 0.063 0.057 109.6
Freddy Garcia 364 19 17.4 0.052 0.048 109.4
Felix Hernandez 556 30 27.4 0.054 0.049 109.3
Kyle Lohse 380 22 20.3 0.058 0.053 108.5
James Shields 489 27 24.9 0.055 0.051 108.3
Brandon McCarthy 424 22 20.3 0.052 0.048 108.3
Tim Wakefield 330 17 15.7 0.052 0.048 108.0
David Price 425 23 21.6 0.054 0.051 106.6
Chris Narveson 348 22 20.7 0.063 0.059 106.3
Gio Gonzalez 392 22 21.0 0.056 0.053 104.9
Nick Blackburn 380 23 22.0 0.061 0.058 104.7
Zachary Britton 360 23 22.2 0.064 0.062 103.6
Jeff Francis 468 31 30.0 0.066 0.064 103.4
Tyler Chatwood 330 19 18.4 0.058 0.056 103.1
Brad Penny 519 25 24.4 0.048 0.047 102.5
Fausto Carmona 497 27 26.5 0.054 0.053 101.8
Chris Capuano 358 25 24.7 0.070 0.069 101.0
Jason Vargas 448 22 21.9 0.049 0.049 100.6
Zack Greinke 329 20 20.0 0.061 0.061 99.9
Derek Holland 438 25 25.1 0.057 0.057 99.6
Chad Billingsley 378 24 24.1 0.063 0.064 99.4
Dillon Gee 346 21 21.2 0.061 0.061 99.2
John Lackey 405 17 17.2 0.042 0.042 98.8
John Danks 394 23 23.4 0.058 0.059 98.5
Tim Hudson 485 34 34.9 0.070 0.072 97.3
Trevor Cahill 532 28 28.8 0.053 0.054 97.2
Brandon Morrow 342 14 14.4 0.041 0.042 97.1
Jered Weaver 422 18 18.7 0.043 0.044 96.1
Derek Lowe 500 33 34.3 0.066 0.069 96.1
Chris Carpenter 531 27 28.3 0.051 0.053 95.5
Roy Halladay 525 29 30.6 0.055 0.058 94.9
Daniel Hudson 434 21 22.1 0.048 0.051 94.9
Luke Hochevar 476 25 26.4 0.053 0.055 94.7
Cole Hamels 418 25 27.0 0.060 0.065 92.5
Bartolo Colon 390 19 20.6 0.049 0.053 92.3
Mike Pelfrey 469 26 28.2 0.055 0.060 92.2
Wade Davis 413 17 18.9 0.041 0.046 90.0
C.J. Wilson 446 23 25.9 0.052 0.058 88.8
Roy Oswalt 327 17 19.2 0.052 0.059 88.4
Jeremy Hellickson 395 15 17.0 0.038 0.043 88.4
Johnny Cueto 335 19 21.5 0.057 0.064 88.4
Jo-Jo Reyes 321 16 18.1 0.050 0.056 88.3
John Lannan 457 28 31.9 0.061 0.070 87.8
Gavin Floyd 437 20 22.9 0.046 0.052 87.5
Cliff Lee 440 23 26.7 0.052 0.061 86.2
Ricky Nolasco 476 24 27.9 0.050 0.059 85.9
Ervin Santana 462 21 24.6 0.045 0.053 85.4
Aaron Harang 374 16 19.1 0.043 0.051 83.9
Jordan Zimmermann 314 16 19.1 0.051 0.061 83.6
Ivan Nova 438 22 27.2 0.050 0.062 80.7
Jon Lester 352 16 20.0 0.045 0.057 80.1
Rick Porcello 501 20 25.0 0.040 0.050 80.0
Chris Volstad 412 20 25.1 0.049 0.061 79.8
Joel Pineiro 384 16 20.3 0.042 0.053 78.9
Mike Leake 346 17 21.9 0.049 0.063 77.8
Matt Garza 418 18 23.3 0.043 0.056 77.2
Kyle McClellan 343 16 20.8 0.047 0.061 77.0
Philip Humber 383 16 20.9 0.042 0.055 76.4
Wandy Rodriguez 356 18 25.2 0.051 0.071 71.4
Alexi Ogando 414 12 17.2 0.029 0.042 69.7
A.J. Burnett 449 18 26.6 0.040 0.059 67.7
Colby Lewis 423 12 18.3 0.028 0.043 65.5
Brett Myers 445 19 29.0 0.043 0.065 65.5
CC Sabathia 430 20 30.7 0.047 0.071 65.2
Brian Duensing 345 16 26.6 0.046 0.077 60.1
Carlos Zambrano 331 10 17.9 0.030 0.054 55.9
Jaime Garcia 452 16 30.6 0.035 0.068 52.3
Jason Marquis 318 11 21.3 0.035 0.067 51.5
Max Scherzer 448 9 19.1 0.020 0.043 47.0

Like catchers, the small number of balls that are fielded by pitchers means you should take these numbers with a big grain of salt. It is interesting to note that with all his other problems, A.J. Burnett did little with his glove to help himself. On the other hand, few on the Yankees staff shone on defense.

Their cross town rivals, the Mets, did very well. I wonder if R.A.Dickey’s knuckle ball induces a lot of easy comebackers?

March 6, 2012

Objective PMR, Catchers

The series on objective probabilistic model of range (PMR) continues by looking at catchers. Compared to the players standing behind the pitcher, range is a small component of a catcher’s defensive value. I’ll show teams as a whole at the position, plus individuals who were on the field for 750 balls in play. First the teams:

Objective PMR, team catchers, 2011. Model built on data from 2005-2010, visiting teams only.
Team In Play Actual Outs Predicted Outs Actual DER Predicted DER Index
BAL 2754 59 45.8 0.021 0.017 128.8
TBA 3195 53 44.2 0.017 0.014 120.0
SEA 3171 55 46.2 0.017 0.015 119.2
PHI 3274 68 57.9 0.021 0.018 117.4
ANA 2794 49 42.5 0.018 0.015 115.3
CLE 2935 49 42.9 0.017 0.015 114.1
TEX 2721 51 45.7 0.019 0.017 111.6
OAK 2981 55 49.8 0.018 0.017 110.4
SLN 3294 56 50.8 0.017 0.015 110.2
FLO 3326 59 54.6 0.018 0.016 108.1
TOR 2830 44 41.4 0.016 0.015 106.2
NYN 3445 62 59.4 0.018 0.017 104.3
CHN 3084 53 50.9 0.017 0.017 104.1
SFN 1589 28 27.0 0.018 0.017 103.8
KCA 2889 45 43.7 0.016 0.015 103.1
MIL 3374 52 50.7 0.015 0.015 102.5
ARI 3269 61 59.9 0.019 0.018 101.8
ATL 3342 49 48.2 0.015 0.014 101.6
LAN 3015 58 57.2 0.019 0.019 101.3
BOS 2847 44 43.9 0.015 0.015 100.3
NYA 2855 46 47.8 0.016 0.017 96.2
HOU 3148 50 53.0 0.016 0.017 94.4
SDN 2957 49 52.0 0.017 0.018 94.3
DET 2573 36 38.3 0.014 0.015 94.0
MIN 2709 43 47.2 0.016 0.017 91.1
PIT 1679 26 28.9 0.015 0.017 90.1
WAS 3303 48 53.4 0.015 0.016 89.9
CIN 3267 50 59.4 0.015 0.018 84.2
COL 3401 40 51.8 0.012 0.015 77.3
CHA 2829 34 47.3 0.012 0.017 72.0

Note that catchers are involved in a small number of plays, so in addition to range being a small part of their value, the number for them are going to be highly variable. That said, the Orioles combination of Matt Wieters and Craig Tatum did a nice job for the Orioles. The White Sox, on the other hand did not get much range from their catchers. A.J. Pierzynski and Ramon Castro were both rather old.

The individuals:

Objective PMR, individual catchers, 2011. Model built on data from 2005-2010, visiting teams only. 750 balls in play, minimum.
Fielder In Play Actual Outs Predicted Outs Actual DER Predicted DER Index
Rob Johnson 1025 26 18.7 0.025 0.018 139.2
Lou Marson 1365 28 20.2 0.021 0.015 138.3
Miguel Olivo 2331 46 33.3 0.020 0.014 138.2
Mike Napoli 945 21 15.4 0.022 0.016 136.8
Josh Thole 1853 44 32.4 0.024 0.017 135.9
J.P. Arencibia 2048 39 30.2 0.019 0.015 129.1
Hank Conger 845 16 12.8 0.019 0.015 125.2
Geovany Soto 2218 44 36.2 0.020 0.016 121.5
Matt Wieters 2128 41 34.5 0.019 0.016 118.9
Miguel Montero 2641 56 48.4 0.021 0.018 115.6
Carlos Ruiz 2339 47 42.3 0.020 0.018 111.2
Ramon Hernandez 1491 29 26.3 0.019 0.018 110.1
Russell Martin 2075 37 33.6 0.018 0.016 110.1
Rod Barajas 1522 32 29.2 0.021 0.019 109.6
Kurt Suzuki 2327 43 39.3 0.018 0.017 109.5
J.R. Towles 857 16 14.8 0.019 0.017 108.0
Yorvit Torrealba 1569 29 27.0 0.018 0.017 107.3
Kelly Shoppach 1384 22 20.6 0.016 0.015 106.7
Brian McCann 2450 37 34.7 0.015 0.014 106.5
John Jaso 1329 19 17.9 0.014 0.013 106.2
John Buck 2572 45 42.5 0.017 0.017 105.9
Dioner Navarro 912 18 17.2 0.020 0.019 104.9
Jeff Mathis 1341 22 21.3 0.016 0.016 103.3
Yadier Molina 2604 41 40.2 0.016 0.015 102.1
Jarrod Saltalamacchia 1734 27 26.6 0.016 0.015 101.5
Matt Treanor 1144 17 17.1 0.015 0.015 99.5
Alex Avila 2058 30 30.7 0.015 0.015 97.8
Humberto Quintero 1438 23 23.8 0.016 0.017 96.8
Jason Varitek 1069 16 16.8 0.015 0.016 95.4
David Ross 850 12 12.6 0.014 0.015 95.3
Carlos Santana 1570 21 22.7 0.013 0.014 92.5
Drew Butera 1341 21 23.3 0.016 0.017 90.2
Wilson Ramos 2216 31 35.2 0.014 0.016 88.0
Brayan Pena 1068 13 15.4 0.012 0.014 84.7
A.J. Pierzynski 1972 27 32.6 0.014 0.017 82.9
Jonathan Lucroy 2365 29 35.1 0.012 0.015 82.6
Carlos Corporan 815 11 13.8 0.013 0.017 80.0
Chris Iannetta 2208 26 33.2 0.012 0.015 78.3
Ronny Paulino 1239 15 20.3 0.012 0.016 73.9
Nick Hundley 1271 17 23.0 0.013 0.018 73.8
Koyie Hill 756 8 12.9 0.011 0.017 62.0
Ryan Hanigan 1530 16 28.3 0.010 0.018 56.6
Jose Molina 782 5 11.2 0.006 0.014 44.5

You can see that indeed, Wieters did a good job behind the plate, although Miguel Olivo comes out on top among starters. Mike Napoli’s good result shows how a small sample size can fool you, however. I don’t think most observers seem him as a great defensive catcher. Take this particular category with a grain of salt.

The information used here was obtained free of charge from and is copyrighted by Retrosheet. Interested parties may contact Retrosheet at 20 Sunset Rd., Newark, DE 19711.

March 1, 2012

Objective PMR, First Basemen

The series on objective probabilistic model of range (PMR) continues by looking at first basemen. I’ll show teams as a whole at the position, plus individuals who were on the field for 1000 balls in play. First the teams:

Objective PMR, team first basemen, 2011. Model built on data from 2005-2010, visiting teams only.
Team In Play Actual Outs Predicted Outs Actual DER Predicted DER Index
NYN 3180 350 283.0 0.110 0.089 123.7
LAN 2739 289 251.6 0.106 0.092 114.9
OAK 3163 347 306.0 0.110 0.097 113.4
BOS 2977 341 302.4 0.115 0.102 112.8
ANA 3160 341 317.1 0.108 0.100 107.5
SLN 3192 300 281.5 0.094 0.088 106.6
HOU 2958 288 270.1 0.097 0.091 106.6
MIN 3124 308 289.3 0.099 0.093 106.5
CIN 3052 308 294.7 0.101 0.097 104.5
WAS 3166 307 297.1 0.097 0.094 103.3
SFN 1445 142 139.7 0.098 0.097 101.7
COL 3038 305 302.0 0.100 0.099 101.0
CLE 3338 326 325.2 0.098 0.097 100.2
FLO 3111 295 297.4 0.095 0.096 99.2
NYA 3169 297 301.4 0.094 0.095 98.5
SEA 3178 284 292.0 0.089 0.092 97.3
KCA 3210 291 300.5 0.091 0.094 96.8
TBA 2988 298 309.7 0.100 0.104 96.2
PHI 3079 288 305.0 0.094 0.099 94.4
MIL 2966 271 289.0 0.091 0.097 93.8
CHA 3266 278 298.1 0.085 0.091 93.3
CHN 3016 252 275.7 0.084 0.091 91.4
BAL 3161 279 306.4 0.088 0.097 91.1
ATL 2968 268 295.6 0.090 0.100 90.6
TOR 3194 268 297.7 0.084 0.093 90.0
SDN 2887 254 283.6 0.088 0.098 89.6
DET 3103 276 310.2 0.089 0.100 89.0
PIT 1612 128 147.0 0.079 0.091 87.1
ARI 2953 242 278.6 0.082 0.094 86.8
TEX 3038 234 273.3 0.077 0.090 85.6

We found a defense position where the Mets are good! Of course, it’s the least important position in terms of defense. Oakland’s great defense at the position hardly made up for their combined .219/.294/.316 batting line in 2011.

The individuals:

Objective PMR, individual first basemen, 2011. Model built on data from 2005-2010, visiting teams only. 1000 balls in play, minimum.
Fielder In Play Actual Outs Predicted Outs Actual DER Predicted DER Index
Daric Barton 1227 149 120.2 0.121 0.098 124.0
James Loney 2331 251 212.2 0.108 0.091 118.3
Adrian Gonzalez 2768 316 279.7 0.114 0.101 113.0
Mark Trumbo 2705 302 274.3 0.112 0.101 110.1
Albert Pujols 2732 263 240.0 0.096 0.088 109.6
Todd Helton 2140 226 211.7 0.106 0.099 106.7
Carlos Lee 1240 122 114.4 0.098 0.092 106.6
Joey Votto 2964 302 286.2 0.102 0.097 105.5
Brett Wallace 1634 157 149.2 0.096 0.091 105.3
Carlos Santana 1299 129 123.4 0.099 0.095 104.5
Mark Teixeira 2788 272 265.7 0.098 0.095 102.4
Gaby Sanchez 2899 275 277.3 0.095 0.096 99.2
Eric Hosmer 2482 227 234.3 0.091 0.094 96.9
Justin Smoak 1970 178 183.9 0.090 0.093 96.8
Matt LaPorta 1880 179 185.6 0.095 0.099 96.4
Mike Morse 1593 145 150.4 0.091 0.094 96.4
Ryan Howard 2724 262 274.6 0.096 0.101 95.4
Prince Fielder 2850 263 278.6 0.092 0.098 94.4
Paul Konerko 2221 185 201.0 0.083 0.090 92.1
Carlos Pena 2666 224 243.4 0.084 0.091 92.0
Derrek Lee 1789 166 180.4 0.093 0.101 92.0
Mitch Moreland 1715 142 156.4 0.083 0.091 90.8
Justin Morneau 1074 94 104.6 0.088 0.097 89.9
Casey Kotchman 2492 233 259.5 0.093 0.104 89.8
Adam Lind 2115 178 198.2 0.084 0.094 89.8
Freddie Freeman 2752 243 272.2 0.088 0.099 89.3
Lyle Overbay 1191 93 109.0 0.078 0.092 85.3
Miguel Cabrera 2864 242 286.4 0.084 0.100 84.5

It looks like Prince Fielder will be an improvement over Miguel Cabrera at first base for the Tigers, but no range at first may be terrible range at third base. Albert Pujols may have lost a step, as he is usually head and shoulders above the other first basemen defensively. He remains very good, however, especially given his bat. Adrian Gonzalez may have moved ahead of Albert in terms of total package, offense and defense, among first basemen.

The Baseball Musings Pledge Drive continues through March. Please make a donation!

February 29, 2012

Objective PMR, Leftfielders

The series on objective probabilistic model of range (PMR) continues by looking at leftfielders. I’ll show teams as a whole at the position, plus individuals who were on the field for 1000 balls in play. First the teams:

Objective PMR, team leftfielders, 2011. Model built on data from 2005-2010, visiting teams only.
Team In Play Actual Outs Predicted Outs Actual DER Predicted DER Index
NYA 2947 363 297.9 0.123 0.101 121.8
ARI 2870 335 304.0 0.117 0.106 110.2
TBA 3455 358 325.0 0.104 0.094 110.1
KCA 3740 353 323.7 0.094 0.087 109.1
LAN 2766 272 253.8 0.098 0.092 107.2
WAS 2546 305 288.1 0.120 0.113 105.9
SEA 3586 350 331.2 0.098 0.092 105.7
ATL 2983 285 270.5 0.096 0.091 105.4
DET 3106 349 332.2 0.112 0.107 105.0
SFN 1492 150 143.3 0.101 0.096 104.7
BOS 3106 313 302.9 0.101 0.098 103.3
HOU 3304 259 252.8 0.078 0.076 102.5
MIL 3285 263 257.8 0.080 0.078 102.0
ANA 3816 350 343.5 0.092 0.090 101.9
CIN 3424 290 285.0 0.085 0.083 101.8
TEX 3116 309 305.7 0.099 0.098 101.1
MIN 2807 330 329.1 0.118 0.117 100.3
BAL 3638 353 356.7 0.097 0.098 99.0
CHA 3520 310 317.1 0.088 0.090 97.8
CLE 3817 319 329.0 0.084 0.086 97.0
SDN 2880 273 282.5 0.095 0.098 96.6
PIT 1662 128 133.5 0.077 0.080 95.9
FLO 3273 282 294.7 0.086 0.090 95.7
NYN 2664 276 290.0 0.104 0.109 95.2
TOR 3572 297 313.1 0.083 0.088 94.9
CHN 2998 247 262.5 0.082 0.088 94.1
PHI 2507 242 257.1 0.097 0.103 94.1
OAK 3261 285 310.9 0.087 0.095 91.7
SLN 3294 262 287.6 0.080 0.087 91.1
COL 3336 220 263.5 0.066 0.079 83.5

I can’t say I’m surprised to see the Yankees and Rays near the top. Brett Gardner is known for his defense, and Sam Fuld made some amazing plays for the Rays before Desmond Jennings moved into the lineup.

The individuals:

Objective PMR, individual leftfielders, 2011. Model built on data from 2005-2010, visiting teams only. 1000 balls in play, minimum.
Fielder In Play Actual Outs Predicted Outs Actual DER Predicted DER Index
Brett Gardner 2387 294 237.4 0.123 0.099 123.9
Gerardo Parra 2051 244 216.6 0.119 0.106 112.7
Vernon Wells 2408 241 214.8 0.100 0.089 112.2
Sam Fuld 1505 162 146.7 0.108 0.097 110.4
Josh Hamilton 1472 153 140.7 0.104 0.096 108.7
Ryan Braun 2826 237 219.8 0.084 0.078 107.8
Alex Gordon 3419 313 292.1 0.092 0.085 107.2
Martin Prado 1656 165 158.4 0.100 0.096 104.2
Tony Gwynn 1165 110 107.4 0.094 0.092 102.5
Nolan Reimold 1524 146 143.4 0.096 0.094 101.8
Jonny Gomes 1269 112 110.8 0.088 0.087 101.1
Desmond Jennings 1000 100 98.9 0.100 0.099 101.1
Carl Crawford 2342 235 233.2 0.100 0.100 100.8
Carlos Lee 1509 113 114.6 0.075 0.076 98.6
Delmon Young 1886 208 214.2 0.110 0.114 97.1
Jason Bay 1922 204 210.3 0.106 0.109 97.0
David Murphy 1278 123 127.0 0.096 0.099 96.8
Logan Morrison 2295 197 207.9 0.086 0.091 94.7
Juan Pierre 3282 278 295.0 0.085 0.090 94.2
Michael Brantley 1396 114 121.5 0.082 0.087 93.8
Alfonso Soriano 2062 169 181.2 0.082 0.088 93.3
Raul Ibanez 1983 195 209.1 0.098 0.105 93.2
Eric Thames 1132 91 99.1 0.080 0.088 91.8
Josh Willingham 1862 155 170.5 0.083 0.092 90.9
Ryan Ludwick 1864 155 173.6 0.083 0.093 89.3
Matt Holliday 2310 182 204.0 0.079 0.088 89.2
Carlos Gonzalez 1094 76 87.4 0.069 0.080 86.9

Gardner and Fuld were indeed very good, and Desmond Jennings held his own. I’m impressed with Carlos Lee’s ranking. At this point, I would think he would be old and slow, but it looks like he gets to most of the balls that he should catch. Juan Pierre, Alfonso Soriano, Raul Ibanez, and Matt Holliday all lost a step.

February 29, 2012

Objective PMR, Rightfielders

The series on objective probabilistic model of range (PMR) continues by looking at rightfielders. I’ll show teams as a whole at the position, plus individuals who were on the field for 1000 balls in play. First the teams:

Objective PMR, team rightfielders, 2011. Model built on data from 2005-2010, visiting teams only.
Team In Play Actual Outs Predicted Outs Actual DER Predicted DER Index
CLE 3675 356 298.5 0.097 0.081 119.2
CHN 3301 329 296.1 0.100 0.090 111.1
FLO 3130 321 289.3 0.103 0.092 110.9
OAK 2719 331 306.7 0.122 0.113 107.9
SDN 3411 315 293.1 0.092 0.086 107.5
TOR 3101 343 322.8 0.111 0.104 106.2
ARI 3522 322 303.1 0.091 0.086 106.2
MIN 3014 334 316.1 0.111 0.105 105.7
NYA 3035 332 319.4 0.109 0.105 104.0
SLN 3538 313 302.1 0.088 0.085 103.6
HOU 3433 307 297.6 0.089 0.087 103.2
BOS 3337 326 316.8 0.098 0.095 102.9
WAS 2956 329 320.6 0.111 0.108 102.6
CHA 3094 297 292.6 0.096 0.095 101.5
ATL 3449 308 303.6 0.089 0.088 101.4
KCA 3616 348 344.1 0.096 0.095 101.1
TEX 2773 327 329.0 0.118 0.119 99.4
MIL 3439 304 307.7 0.088 0.089 98.8
CIN 3639 306 313.2 0.084 0.086 97.7
DET 3089 304 314.8 0.098 0.102 96.6
ANA 3269 310 322.3 0.095 0.099 96.2
TBA 2890 312 324.2 0.108 0.112 96.2
COL 3521 276 288.3 0.078 0.082 95.7
LAN 3232 278 295.7 0.086 0.091 94.0
PIT 1774 144 155.4 0.081 0.088 92.7
PHI 3117 259 281.1 0.083 0.090 92.1
BAL 3662 312 339.9 0.085 0.093 91.8
SFN 1635 144 159.5 0.088 0.098 90.3
NYN 3354 280 323.1 0.083 0.096 86.7
SEA 3439 278 325.0 0.081 0.095 85.5

The Mets just keep coming up as a bad defensive team. At least that’s an area they likely can improve without a huge cost.

The individuals:

Objective PMR, individual rightfielders, 2011. Model built on data from 2005-2010, visiting teams only. 1000 balls in play, minimum.
Fielder In Play Actual Outs Predicted Outs Actual DER Predicted DER Index
Kosuke Fukudome 2564 263 221.9 0.103 0.087 118.5
Shin-Soo Choo 1895 172 153.6 0.091 0.081 112.0
Mike Stanton 2644 271 245.6 0.102 0.093 110.3
Chris Denorfia 1090 98 88.9 0.090 0.082 110.2
David DeJesus 1831 231 209.8 0.126 0.115 110.1
Jayson Werth 2411 277 259.4 0.115 0.108 106.8
Jason Heyward 2308 210 197.6 0.091 0.086 106.3
Justin Upton 3346 306 288.0 0.091 0.086 106.3
Nick Swisher 2516 273 259.8 0.109 0.103 105.1
Jose Bautista 2188 233 221.7 0.106 0.101 105.1
J.D. Drew 1387 142 136.0 0.102 0.098 104.4
Nelson Cruz 1667 216 207.0 0.130 0.124 104.4
Michael Cuddyer 1317 139 134.7 0.106 0.102 103.2
Will Venable 1512 133 129.7 0.088 0.086 102.5
Jeff Francoeur 3385 328 320.4 0.097 0.095 102.4
Carlos Quentin 1784 177 174.3 0.099 0.098 101.5
Corey Hart 2627 236 232.7 0.090 0.089 101.4
Ben Francisco 1007 86 86.0 0.085 0.085 100.0
Seth Smith 2037 166 167.5 0.081 0.082 99.1
Torii Hunter 2637 259 262.7 0.098 0.100 98.6
Jay Bruce 3399 285 290.3 0.084 0.085 98.2
Andre Ethier 2507 221 226.7 0.088 0.090 97.5
Hunter Pence 3083 259 271.8 0.084 0.088 95.3
Lance Berkman 2103 160 169.3 0.076 0.081 94.5
Matthew Joyce 2005 211 228.1 0.105 0.114 92.5
Nick Markakis 3544 301 328.9 0.085 0.093 91.5
Carlos Beltran 2039 169 193.4 0.083 0.095 87.4
Ichiro Suzuki 3225 263 306.8 0.082 0.095 85.7
Magglio Ordonez 1056 87 104.7 0.082 0.099 83.1

Asian players round out the extremes, with Shin-Soo Choo and Kosuke Fukudome at the top, and Ichiro at the bottom. This may be another example of Ichiro losing a step as he ages. The good number posted by Jayson Werth may mean he’s not so bad if and when the Nationals stick him in centerfield. Amazingly, Carlos Beltran comes in worse than Lance Berkman. Maybe the Cardinals are moving the wrong player to first base. 🙂

February 28, 2012

Objective PMR, Third Baseman

The series on objective probabilistic model of range (PMR) continues by looking at third basemen. I’ll show teams as a whole at the position, plus individuals who were on the field for 1000 balls in play. First the teams:

Objective PMR, team third baseman, 2011. Model built on data from 2005-2010, visiting teams only.
Team In Play Actual Outs Predicted Outs Actual DER Predicted DER Index
CLE 3325 454 383.9 0.137 0.115 118.3
PIT 1615 233 202.8 0.144 0.126 114.9
TOR 3190 468 407.2 0.147 0.128 114.9
DET 3105 406 354.6 0.131 0.114 114.5
TBA 3007 427 374.2 0.142 0.124 114.1
ANA 3171 408 364.9 0.129 0.115 111.8
TEX 3038 396 377.9 0.130 0.124 104.8
NYA 3174 379 368.5 0.119 0.116 102.8
CIN 3052 376 367.0 0.123 0.120 102.4
KCA 3202 393 385.4 0.123 0.120 102.0
ARI 2953 398 390.8 0.135 0.132 101.8
BOS 2976 391 385.4 0.131 0.129 101.5
SFN 1447 178 176.8 0.123 0.122 100.7
WAS 3205 383 380.4 0.120 0.119 100.7
SDN 2888 343 352.4 0.119 0.122 97.3
MIL 2959 339 348.9 0.115 0.118 97.2
PHI 3085 399 415.7 0.129 0.135 96.0
LAN 2733 323 337.4 0.118 0.123 95.7
SEA 3326 373 389.8 0.112 0.117 95.7
COL 3037 335 352.8 0.110 0.116 95.0
ATL 2974 343 362.3 0.115 0.122 94.7
SLN 3193 356 391.0 0.111 0.122 91.0
BAL 3162 324 363.2 0.102 0.115 89.2
CHA 3380 369 414.6 0.109 0.123 89.0
OAK 3169 341 387.6 0.108 0.122 88.0
MIN 2963 343 396.4 0.116 0.134 86.5
NYN 3214 354 410.8 0.110 0.128 86.2
FLO 3118 313 368.9 0.100 0.118 84.8
CHN 3021 297 358.8 0.098 0.119 82.8
HOU 2959 305 369.2 0.103 0.125 82.6

I was somewhat worried about Miami having poor left-side defense, but they were already poor last year. Also, Chone Figgins‘s return to third base did not appear to help the Mariner’s defense.

The individuals:

Objective PMR, individual third baseman, 2011. Model built on data from 2005-2010, visiting teams only. 1000 balls in play, minimum.
Fielder In Play Actual Outs Predicted Outs Actual DER Predicted DER Index
Jack Hannahan 1900 267 212.3 0.141 0.112 125.8
Brandon Inge 1562 219 179.8 0.140 0.115 121.8
Evan Longoria 2305 330 285.1 0.143 0.124 115.7
Alberto Callaspo 2474 327 286.8 0.132 0.116 114.0
Lonnie Chisenhall 1068 150 132.1 0.140 0.124 113.6
Adrian Beltre 2023 277 254.8 0.137 0.126 108.7
Alex Rodriguez 1659 206 192.9 0.124 0.116 106.8
Mike Moustakas 1664 211 203.0 0.127 0.122 103.9
Scott Rolen 1098 133 129.2 0.121 0.118 102.9
Ryan Roberts 1858 244 241.4 0.131 0.130 101.1
Ryan Zimmerman 2065 246 244.1 0.119 0.118 100.8
Wilson Betemit 1582 186 185.6 0.118 0.117 100.2
Placido Polanco 2187 294 295.9 0.134 0.135 99.4
Kevin Youkilis 1913 241 243.3 0.126 0.127 99.1
Casey McGehee 2570 296 300.1 0.115 0.117 98.6
Kevin Kouzmanoff 1213 143 151.9 0.118 0.125 94.2
Chase Headley 1877 213 232.7 0.113 0.124 91.5
David Freese 1497 164 180.7 0.110 0.121 90.8
Chone Figgins 1536 160 180.0 0.104 0.117 88.9
Brent Morel 2456 265 300.4 0.108 0.122 88.2
Daniel Descalso 1397 151 172.4 0.108 0.123 87.6
Mark Reynolds 2144 209 239.5 0.097 0.112 87.3
Ty Wigginton 1146 111 128.3 0.097 0.112 86.5
Danny Valencia 2667 303 355.7 0.114 0.133 85.2
Chris Johnson 1869 195 233.0 0.104 0.125 83.7
Scott Sizemore 1706 176 210.2 0.103 0.123 83.7
David Wright 2102 225 269.2 0.107 0.128 83.6
Chipper Jones 2016 208 249.3 0.103 0.124 83.4
Greg Dobbs 1602 159 195.0 0.099 0.122 81.5
Aramis Ramirez 2590 245 309.3 0.095 0.119 79.2

Jack Hannahan isn’t much of an offensive player, and when that’s the case you better be a wizard with the glove. He fit that bill in 2011. With Inge and Longoria ranked second and third, and I’m very happy with the results this produced. I am somewhat surprised Aramis Ramirez ranked so low, although he is getting up in age. Fangraphs ranks his fielding low, however, so maybe it’s not that surprising.

February 28, 2012

Objective PMR, Centerfielders

The series on objective probabilistic model of range (PMR) continues by looking at centerfielders. I’ll show teams as a whole at the position, plus individuals who were on the field for 1000 balls in play. First the teams:

Team In Play Actual Outs Predicted Outs Actual DER Predicted DER Index
PIT 1477 209 184.0 0.142 0.125 113.6
CHA 3206 424 389.9 0.132 0.122 108.7
MIN 2968 459 422.9 0.155 0.142 108.5
MIL 2760 412 381.4 0.149 0.138 108.0
SLN 2792 391 361.9 0.140 0.130 108.0
HOU 2696 388 359.5 0.144 0.133 107.9
COL 2300 388 362.6 0.169 0.158 107.0
PHI 2610 366 343.3 0.140 0.132 106.6
CHN 2882 362 344.6 0.126 0.120 105.1
NYN 2677 389 376.3 0.145 0.141 103.4
SEA 2513 434 421.7 0.173 0.168 102.9
SDN 3021 390 383.4 0.129 0.127 101.7
CIN 2799 377 373.4 0.135 0.133 101.0
ATL 2926 369 367.3 0.126 0.126 100.5
OAK 2361 395 393.6 0.167 0.167 100.4
FLO 3232 384 384.7 0.119 0.119 99.8
DET 3376 422 425.6 0.125 0.126 99.2
ANA 3505 416 420.5 0.119 0.120 98.9
TBA 3204 420 426.9 0.131 0.133 98.4
SFN 1357 195 199.4 0.144 0.147 97.8
BOS 3338 407 420.0 0.122 0.126 96.9
WAS 2771 396 410.4 0.143 0.148 96.5
CLE 3701 390 404.4 0.105 0.109 96.4
TEX 2714 398 415.4 0.147 0.153 95.8
NYA 2624 387 411.3 0.147 0.157 94.1
LAN 2784 337 360.0 0.121 0.129 93.6
BAL 2996 421 452.3 0.141 0.151 93.1
TOR 3064 392 432.2 0.128 0.141 90.7
ARI 3067 382 423.4 0.125 0.138 90.2
KCA 3188 373 426.3 0.117 0.134 87.5

The individuals:

Fielder In Play Actual Outs Predicted Outs Actual DER Predicted DER Index
Andrew McCutchen 1325 198 167.0 0.149 0.126 118.6
Jon Jay 1033 153 130.9 0.148 0.127 116.9
Denard Span 1251 206 176.5 0.165 0.141 116.7
Jordan Schafer 1225 181 158.4 0.148 0.129 114.2
Carlos Gomez 1119 179 157.2 0.160 0.141 113.8
Dexter Fowler 1656 283 258.0 0.171 0.156 109.7
Nyjer Morgan 1354 202 186.3 0.149 0.138 108.5
Shane Victorino 1998 284 262.7 0.142 0.131 108.1
Marlon Byrd 1884 248 234.0 0.132 0.124 106.0
Ben Revere 1563 237 224.4 0.152 0.144 105.6
Alex Rios 2712 349 332.5 0.129 0.123 105.0
Cameron Maybin 2386 318 303.5 0.133 0.127 104.8
Coco Crisp 1800 311 300.1 0.173 0.167 103.6
Franklin Gutierrez 1310 237 228.9 0.181 0.175 103.5
Angel Pagan 1965 278 269.4 0.141 0.137 103.2
Drew Stubbs 2540 345 339.4 0.136 0.134 101.7
Michael Bourn 2677 354 349.3 0.132 0.130 101.3
Austin Jackson 2915 372 367.6 0.128 0.126 101.2
Rick Ankiel 1449 223 221.9 0.154 0.153 100.5
Chris Coghlan 1273 145 145.7 0.114 0.114 99.5
Colby Rasmus 2124 290 295.8 0.137 0.139 98.0
B.J. Upton 2911 382 391.2 0.131 0.134 97.6
Jacoby Ellsbury 3101 377 386.9 0.122 0.125 97.4
Ezequiel Carrera 1091 112 115.3 0.103 0.106 97.2
Michael Brantley 1108 118 122.0 0.106 0.110 96.7
Peter Bourjos 3054 351 363.3 0.115 0.119 96.6
Grady Sizemore 1138 119 125.0 0.105 0.110 95.2
Curtis Granderson 2457 354 383.5 0.144 0.156 92.3
Adam Jones 2693 377 408.7 0.140 0.152 92.3
Matt Kemp 2680 320 346.9 0.119 0.129 92.2
Chris Young 2939 367 405.8 0.125 0.138 90.4
Rajai Davis 1390 174 194.7 0.125 0.140 89.4
Melky Cabrera 2759 316 371.9 0.115 0.135 85.0

The thing that surprises me the most is Peter Bourjos’s ranking. He’s supposed to be really fast, but it could be his judgement hasn’t developed yet. FanGraphs does rate him much higher, however.

The Giants are going to be in big trouble if it turns out that Melky Cabrera’s hitting was a one-year wonder. They’ll end up with a centerfielder who can neither hit nor field.

February 27, 2012

Objective PMR, Second Basemen

The series on objective probabilistic model of range (PMR) continues by looking at second basemen. I’ll show teams as a whole at the position, plus individuals who were on the field for 1000 balls in play. First the teams:

Objective PMR, team second baseman, 2011. Model built on data from 2005-2010, visiting teams only.
Team In Play Actual Outs Predicted Outs Actual DER Predicted DER Index
FLO 3122 589 493.3 0.189 0.158 119.4
PHI 3099 528 471.3 0.170 0.152 112.0
NYA 3208 539 490.6 0.168 0.153 109.9
SLN 3208 510 466.2 0.159 0.145 109.4
WAS 3219 503 471.9 0.156 0.147 106.6
CHA 3282 546 521.5 0.166 0.159 104.7
TEX 3048 512 490.4 0.168 0.161 104.4
ARI 2955 449 429.9 0.152 0.145 104.4
CHN 3024 480 460.7 0.159 0.152 104.2
BOS 3018 496 483.3 0.164 0.160 102.6
SEA 3189 514 502.0 0.161 0.157 102.4
NYN 3223 503 492.8 0.156 0.153 102.1
TOR 3213 509 502.3 0.158 0.156 101.3
OAK 3168 519 512.8 0.164 0.162 101.2
HOU 2964 437 432.4 0.147 0.146 101.1
TBA 3018 494 490.7 0.164 0.163 100.7
MIL 2969 440 439.0 0.148 0.148 100.2
COL 3044 468 468.2 0.154 0.154 100.0
CIN 3064 463 464.1 0.151 0.151 99.8
MIN 3249 532 536.1 0.164 0.165 99.2
ATL 3035 463 467.7 0.153 0.154 99.0
LAN 2748 422 432.0 0.154 0.157 97.7
ANA 3196 527 540.1 0.165 0.169 97.6
KCA 3224 507 524.8 0.157 0.163 96.6
CLE 3366 519 538.9 0.154 0.160 96.3
DET 3132 486 510.9 0.155 0.163 95.1
BAL 3416 492 521.3 0.144 0.153 94.4
SDN 2890 438 467.2 0.152 0.162 93.8
SFN 1457 211 226.5 0.145 0.155 93.2
PIT 1622 209 238.3 0.129 0.147 87.7

While the Yankees and Marlins didn’t produce much range at shortstop, they made up for it with great range at second base. Philadelphia remains high despite Chase Utley‘s injury. Employing veteran second basemen didn’t help the Giants and Padres much.

The individuals:

Objective PMR, individual second basemen, 2011. Model built on data from 2005-2010, visiting teams only.
Fielder In Play Actual Outs Predicted Outs Actual DER Predicted DER Index
Omar Infante 2730 526 433.5 0.193 0.159 121.3
Chase Utley 1778 313 266.3 0.176 0.150 117.5
Skip Schumaker 1608 262 229.5 0.163 0.143 114.2
Robinson Cano 2954 495 450.5 0.168 0.153 109.9
Darwin Barney 2265 378 344.2 0.167 0.152 109.8
Ian Kinsler 2694 466 430.3 0.173 0.160 108.3
Danny Espinosa 3096 489 454.9 0.158 0.147 107.5
Gordon Beckham 2918 494 463.1 0.169 0.159 106.7
Mark Ellis 2313 395 371.5 0.171 0.161 106.3
Dustin Pedroia 2874 476 458.8 0.166 0.160 103.7
Jeff Keppinger 1055 165 159.7 0.156 0.151 103.3
Jemile Weeks 1857 303 294.3 0.163 0.158 103.0
Alexi Casilla 1003 176 172.1 0.175 0.172 102.2
Aaron Hill 2588 410 401.9 0.158 0.155 102.0
Justin Turner 1461 221 218.2 0.151 0.149 101.3
Orlando Cabrera 1582 260 257.4 0.164 0.163 101.0
Ben Zobrist 2206 359 358.2 0.163 0.162 100.2
Dan Uggla 2943 455 455.6 0.155 0.155 99.9
Chris Getz 1961 316 318.2 0.161 0.162 99.3
Dustin Ackley 1706 267 269.4 0.157 0.158 99.1
Brandon Phillips 2783 412 418.9 0.148 0.151 98.4
Howie Kendrick 1999 326 332.8 0.163 0.166 98.0
Orlando Hudson 1848 305 312.9 0.165 0.169 97.5
Rickie Weeks 2035 286 295.9 0.141 0.145 96.6
Kelly Johnson 2583 364 377.1 0.141 0.146 96.5
Robert Andino 1744 250 259.7 0.143 0.149 96.3
Jamey Carroll 1020 153 159.9 0.150 0.157 95.7
Neil Walker 1550 202 226.9 0.130 0.146 89.0

I’m very impressed that Chase held up so well despite his age and injuries. Orlando Hudson lost a lot of ground over the years. Looking at the list, there are quite a few second basemen who hit and field well, especially Utley, Cano, Kinsler, and Pedroia. We’ll see how far up the list Jemile Weeks and Dustin Ackley move playing a full season.

February 27, 2012

Objective PMR, Shortstops

The series on objective probabilistic model of range (PMR) continues by looking at shortstops. I’ll show teams as a whole at the position, plus individuals who were on the field for 1000 balls in play. First the teams:

Objective PMR, team shortstops, 2011. Model built on data from 2005-2010, visiting teams only.
Team In Play Actual Outs Predicted Outs Actual DER Predicted DER Index
BAL 3200 585 505.6 0.183 0.158 115.7
CIN 3084 537 482.9 0.174 0.157 111.2
TEX 3057 539 491.0 0.176 0.161 109.8
ATL 3010 525 482.3 0.174 0.160 108.9
CHN 3045 519 477.8 0.170 0.157 108.6
SEA 3195 540 498.9 0.169 0.156 108.2
SFN 1479 249 230.7 0.168 0.156 107.9
ANA 3196 537 510.6 0.168 0.160 105.2
COL 3060 514 488.7 0.168 0.160 105.2
SDN 2917 484 461.5 0.166 0.158 104.9
DET 3128 501 478.3 0.160 0.153 104.7
PIT 1628 278 265.8 0.171 0.163 104.6
CHA 3288 526 506.3 0.160 0.154 103.9
HOU 2984 495 477.4 0.166 0.160 103.7
TBA 3245 486 476.2 0.150 0.147 102.1
KCA 3225 539 529.0 0.167 0.164 101.9
WAS 3218 518 508.3 0.161 0.158 101.9
PHI 3262 507 498.5 0.155 0.153 101.7
ARI 3211 494 489.9 0.154 0.153 100.8
TOR 3212 505 503.8 0.157 0.157 100.2
MIL 2992 461 462.3 0.154 0.155 99.7
LAN 2776 443 449.9 0.160 0.162 98.5
SLN 3238 489 505.4 0.151 0.156 96.8
BOS 3021 469 489.2 0.155 0.162 95.9
MIN 3261 505 538.7 0.155 0.165 93.7
OAK 3185 476 515.2 0.149 0.162 92.4
FLO 3381 440 481.2 0.130 0.142 91.4
CLE 3374 469 535.2 0.139 0.159 87.6
NYN 3271 486 558.0 0.149 0.171 87.1
NYA 3190 426 499.3 0.134 0.157 85.3

As you can see, it was a bad year to be a shortstop in New York. You can also see that in the individual numbers:

Objective PMR, individual shortstops, 2011. Model built on data from 2005-2010, visiting teams only.
Fielder In Play Actual Outs Predicted Outs Actual DER Predicted DER Index
Paul Janish 1528 272 238.3 0.178 0.156 114.2
Brendan Ryan 2346 408 364.6 0.174 0.155 111.9
Alex Gonzalez 2667 476 427.2 0.178 0.160 111.4
J.J. Hardy 2508 433 392.1 0.173 0.156 110.4
Rafael Furcal 1479 246 226.0 0.166 0.153 108.9
Starlin Castro 2975 505 466.8 0.170 0.157 108.2
Elvis Andrus 2672 467 432.8 0.175 0.162 107.9
Edgar Renteria 1368 232 216.1 0.170 0.158 107.4
Clint Barmes 2122 361 337.4 0.170 0.159 107.0
Ronny Cedeno 1243 218 204.5 0.175 0.164 106.6
Reid Brignac 1550 246 231.5 0.159 0.149 106.3
Troy Tulowitzki 2565 429 407.2 0.167 0.159 105.3
Alexei Ramirez 3119 503 479.7 0.161 0.154 104.9
Erick Aybar 2784 461 442.6 0.166 0.159 104.2
Jimmy Rollins 2681 426 414.4 0.159 0.155 102.8
Alcides Escobar 3090 518 506.6 0.168 0.164 102.3
Jason Bartlett 2449 395 386.7 0.161 0.158 102.2
Yuniesky Betancourt 2661 421 412.9 0.158 0.155 102.0
Jhonny Peralta 2708 422 415.8 0.156 0.154 101.5
Ian Desmond 2930 464 463.7 0.158 0.158 100.1
Stephen Drew 1650 248 247.7 0.150 0.150 100.1
Yunel Escobar 2474 378 388.1 0.153 0.157 97.4
Tsuyoshi Nishioka 1134 180 185.8 0.159 0.164 96.9
Ryan Theriot 1702 259 268.8 0.152 0.158 96.3
Marco Scutaro 1920 301 314.2 0.157 0.164 95.8
Emilio Bonifacio 1338 180 190.5 0.135 0.142 94.5
Willie Bloomquist 1074 157 166.8 0.146 0.155 94.1
Cliff Pennington 2800 412 452.7 0.147 0.162 91.0
Hanley Ramirez 1726 220 244.4 0.127 0.142 90.0
Jamey Carroll 1018 142 162.8 0.139 0.160 87.2
Asdrubal Cabrera 3085 423 488.3 0.137 0.158 86.6
Jose Reyes 2471 363 421.9 0.147 0.171 86.0
Derek Jeter 2294 303 358.7 0.132 0.156 84.5

Jim Leyland praised Jhonny Peralta’s defense the other day, and this chart shows he has a point. I don’t think there are that many surprises at the top of the list. Cliff Pennington has to be a disappointment. He’s not a great hitter, so he needs to be a great fielder to be valuable. It really is amazing that no matter how you measure Jeter’s defense, he just doesn’t get to that many balls.

The thing that really stands out, however, is the left-side of the Marlins infield. Jose Reyes and Hanley Ramirez were among the worst shortstops in the majors in 2011. Now they are going to be patrolling that whole side together. Mark Buehrle might be in for an unpleasant surprise this year.

The information used here was obtained free of charge from and is copyrighted by Retrosheet. Interested parties may contact Retrosheet at 20 Sunset Rd., Newark, DE 19711.

February 26, 2012 February 26, 2012

Objective PMR, 2011

I recently downloaded the MYSQL version of the retrosheet database, and am ready to look at the objective probabilistic model of range (PMR). It’s objective since it does not include the subjective vector and hard hit parameters that reporters enter in both the STATS and BIS data. In other words, the models for objective PMR just use batter handedness, pitcher handedness, batted ball type, and park to build the model. I’m using data from 2005-2011 for visiting teams only for model. Note that this means that 2011 fielding has no influence on the models.

As with the normal PMR model, I’m measuring the defensive efficiency record (DER) of the team against the expected DER. The following table shows the results for the 30 teams.

Objective DER, Teams, 2011. Model built on data from 2005-2010.
Team In Play Actual Outs Predicted Outs Actual DER Predicted DER Index
TBA 4157 3007 2917.7 0.723 0.702 103.1
ANA 4413 3106 3030.1 0.704 0.687 102.5
CLE 4462 3083 3024.5 0.691 0.678 101.9
CIN 4053 2867 2820.5 0.707 0.696 101.7
BOS 4167 2926 2884.9 0.702 0.692 101.4
WAS 4259 2967 2930.7 0.697 0.688 101.2
ATL 4054 2825 2792.6 0.697 0.689 101.2
TEX 4161 2923 2891.1 0.702 0.695 101.1
ARI 4075 2867 2838.0 0.704 0.696 101.0
SEA 4298 3000 2972.1 0.698 0.692 100.9
SLN 4162 2855 2833.0 0.686 0.681 100.8
TOR 4324 3002 2981.6 0.694 0.690 100.7
PHI 4016 2828 2808.9 0.704 0.699 100.7
MIL 3955 2732 2712.7 0.691 0.686 100.7
LAN 3735 2619 2601.6 0.701 0.697 100.7
FLO 4140 2862 2842.5 0.691 0.687 100.7
SDN 3921 2764 2748.8 0.705 0.701 100.6
NYA 4277 2936 2922.6 0.686 0.683 100.5
DET 4232 2932 2922.7 0.693 0.691 100.3
CHA 4337 2978 2971.6 0.687 0.685 100.2
CHN 3985 2706 2699.4 0.679 0.677 100.2
SFN 1992 1389 1390.7 0.697 0.698 99.9
HOU 3966 2694 2698.4 0.679 0.680 99.8
PIT 2118 1447 1450.1 0.683 0.685 99.8
COL 4022 2760 2765.1 0.686 0.688 99.8
KCA 4426 3045 3059.1 0.688 0.691 99.5
MIN 4491 3042 3066.5 0.677 0.683 99.2
OAK 4224 2917 2942.4 0.691 0.697 99.1
BAL 4416 3018 3060.8 0.683 0.693 98.6
NYN 4286 2926 2993.4 0.683 0.698 97.7

If you look at the ranking of the teams using UZR/150, there is some agreement between the two systems. Four of the top five teams in objective PMR also ranke in the top five in UZR/150. There is a glaring difference, however, and that’s the Cleveland Indians. UZR ranks the Indians as the 2nd poorest fielding team in the majors, while objective PMR ranks them second. Where are the differences?

UZR/150 rankings versus objective PMR rankings for Indians positions, 2011
Position UZR/150 Rank Objective PMR Rank
Pitcher N/A 2
Catcher N/A 6
First base 28 13
Second base 30 25
Third base 5 1
Shortstop 28 28
Leftfield 21 20
Centerfield 27 23
Rightfield 12 1

As you can see, UZR does not rank ranges for pitchers and catchers, and PMR rated both those positions well for the Indians. Apart from that, most positions are in decent agreement. Both systems rank the shortstops poorly and the third basemen well. The two biggest discrepancies occur at first base and rightfield. I’m not arguing that one is right and one is wrong. It could be that a bias exists in that direction on the field. Maybe it’s the camera angle, so the balls to that side of the field look easier to field than they actually are. On the other hand, the vectors and batted ball velocities may add important information to the model that knocks the Indians down.

It could also be both those effects are in place, and the Indians real fielding value lies somewhere between the two.

The big differences make for the most interesting comparisons. I’d love to be able to dig deeper into this to see what the distribution of fly balls and ground balls is for the Indians, to see which model is doing a better job of creating a model.

I’ll continue this series looking at teams and fielders by position.

The information used here was obtained free of charge from and is copyrighted by Retrosheet. Interested parties may contact Retrosheet at 20 Sunset Rd., Newark, DE 19711.

May 23, 2011

Range and Positioning

I did a poor job of blogging Mitchel Litchman’s talk on Sunday, but there were some points about which I’d like to comment.

The first is that Mitchel stopped penalizing the other fielder when an out is made on a shared play. A shared play is when there is some probability that multiple fielders might catch a ball (think of a fly ball to right-center). Mitchel noted that he’s gone back and forth on this over the years, but finally decided against the penalty. His argument was that the batted ball locations we get from the human scoring systems are not precise, and the reason one fielder makes the play compared to the other is that he was closer to the ball to start.

The Probabilistic Model of Range (PMR) does give the fielder not making the out the penalty, as I believe there is information there. Back in the 1990s, STATS, Inc. published Zone Ratings, and Ken Griffey, Jr. did poorly. That surprised people, because they always saw him on ESPN making these fantastic catches at the wall. John Dewan asked me to look into this, and I found the problem was that Griffey was the opposite of a ball hog. He allowed his left and right fielders to take balls that he should have been catching. Now, it’s possible this was a good strategy; Griffey could play deep and make the catches at the wall while the LF and RF took the short flies. It made him look bad compared to other centerfielders, who tried to catch everything. Ken had the reputation when he was young of not trying on every play, however, so maybe it was Ken just didn’t want to go after balls and let his side fielders take up the slack. Since there weren’t an over abundance of balls dropping in for hits in front of him, I somewhat discount the playing deep theory.

PMR does not try to turn the numbers into runs, like UZR. In a system that assigns run values, I agree that the fielder not making the out should not be penalized. After a few minutes thinking about this, I would penalize the fielder. The run values are built into the model. So when a fielder doesn’t get to a ball, the run values for the outs are built into his model. They’re just not his outs. He doesn’t get charged with a hit, he gets charged with the run expectation for the batted ball, including outs made by other fielders. I think I was right the first time. To paraphrase Dennis Moore, this redistribution of runs in trickier than I thought.

My guess is that there’s not much difference, and that over a large enough sample all these things even out.

Update: See update at the end of the post.

(By the way, I showed there that shared plays account for a pretty small amount of balls in play.)

Mitchel’s point that who catches the ball tells us about the positioning is a fascinating argument, and he takes that a step further with errors. Most systems, like PMR, treat a ball in play as a binary outcome. Either an out was produced or not. It doesn’t matter if it was scored a hit, error, or failed fielder’s choice, the batter was either out or safe. Mitchell notes that if a fielder makes an error on a bin of parameters with a low probability for making an out, he must have been positioned near where the ball was hit. Again, the human marking of where a ball was hit likely has a high variance. If you pick a spot on the field, and look at where reporters mark the ball, you probably get a circle with that point as the center. Since errors tend to happen on balls that should be easy to field, it’s likely the ball was hit closer to the fielder than the probability model indicates. Therefore, Mitchel imposes a bigger penalty on errors.

In other words, the error is telling us that the model we’re using shifted. The problem, of course, is that it doesn’t capture hits that happen when the batter makes a similar shift and is not charged with an error. He gets charged with a low probability event, when he should be charged with a higher one. I might even argue that the fact that the fielder shifted and made the correct decision in the shift should lessen the penalty, since his process was right, even though the result was bad.

What I really take away from this, however, is that it might be possible to build a probabilistic model of positioning. For example, if you look at centerfielders against left and right handed batters, and you see him making more plays on one side of the field based on handedness or pull percentage of the fielders, you can infer that the fielder moves to one side of the field or the other. If you see the distributions look the same, however, he’s probably standing in one place and not moving much.

(PMR accounts for handedness of both the batter and the pitcher, so this should be built into the model.)

Mitchel is trying to tackle the big problem in defense, that range as we measure it is really a combination of the ability to move and the ability to position fielders. Until we get FIELDf/x, we have a very difficult time separating the two.

My one other comment on Mitchel’s talk was that With or Without You (WOWY) was showing much larger run difference between good and bad fielders than UZR. Mitchel threw out a couple of theories why, but I think it just may be a sample size problem. Since the start of the 1996 season, Jeter played 2323 games. That means he played about 94% of the Yankees games in that time. So the sample without Jeter is 150 games, and I don’t know if Mitchel’s data goes back that far. Most fielders who play enough to qualify as bad at the position must do something else good to stay in the lineup, so in general the without you component is small. Mitchel likely took that into account, but that was the first thing that crossed my mind.

Update: Here are my formal thoughts on the way runs should be charged under PMR. Let’s propose a play that looks like this to the team, a fly ball halfway between the default positions for the center and right fielder:

  • Out: 25%, -0.07 runs per ball in play
  • Single 50%, .235 runs per ball in play
  • Double 20%, .154 runs per ball in play
  • Triple 5%, .0545 runs per ball in play

So the expected runs per ball in play is .3735, or 37.35 runs per 100 balls in play. In other words, batter should really try to put the ball in play in that spot. We then add up the run value for the actual events on those balls. If the outfielders are good, the team should sum to less than 37.35 runs. If they are poor, more runs.

This is a shared vector, however, and of the 100 balls in play, 15 are caught by the centerfielder, 10 by the rightfielder. Since the centerfielder gets to 60% of the outs, I’ll assume he gets 60% of the run expectation, and rightfielder 40%. So when a ball falls in for a hit, the CF and RF split the run value for that event 60-40.

Now what happens when the ball is caught. The program could do that same thing, and split the run value of the out 60-40, or give all the run value to the fielder who caught the ball and a zero to the fielder who didn’t. If you split, you’re not penalizing the fielder for who didn’t catch the ball. If you go with the full value of the out to the fielder who catches it, you are penalizing the fielder who doesn’t catch the ball.

If the actual data is around a 60-40 split, it doesn’t matter which system you use, because the results will come out the same. It does matter, matter, however, if the CF is catching 80% of those balls. Mitchel would argue that the scoring of the location of the ball is off, or the outfielders are positioned differently, so don’t penalize the RF for something that isn’t his fault. There is the possibility, however, that the CF is making those plays because he’s a much better outfielder than the RF, in which case the penalty should be applied, and the outfielder making the catch should get full credit for the out. This is one of those known unknowns.

Maybe the right way to do this is to give the fielder making the catch a little more credit. I just feel there is information that we are throwing away if we split the credit on a catch proportionally.

March 2, 2011

Five-Year PMR, First Basemen

The five year look at PMR using an objective probabilistic model of range continues with first basemen. Here is the data for the teams at first over the five year span:

Team first baseman PMR, 2006-2010, four parameter objective model built on visiting team data.
Team In Play Actual Outs Predicted Outs Actual DER Predicted DER Index
SLN 17214 1796 1608.4 0.104 0.093 111.7
SFN 16122 1571 1421.7 0.097 0.088 110.5
CHN 15206 1539 1405.7 0.101 0.092 109.5
ANA 15603 1642 1515.5 0.105 0.097 108.3
HOU 15980 1618 1521.1 0.101 0.095 106.4
TOR 16348 1641 1554.2 0.100 0.095 105.6
SEA 16560 1537 1467.1 0.093 0.089 104.8
PIT 16826 1510 1443.8 0.090 0.086 104.6
SDN 16124 1620 1556.1 0.100 0.097 104.1
NYN 15938 1517 1463.6 0.095 0.092 103.6
CHA 16428 1455 1431.6 0.089 0.087 101.6
BOS 15700 1568 1544.1 0.100 0.098 101.5
NYA 15989 1488 1475.2 0.093 0.092 100.9
BAL 16368 1496 1482.9 0.091 0.091 100.9
FLO 15932 1492 1491.8 0.094 0.094 100.0
TBA 15733 1532 1551.2 0.097 0.099 98.8
TEX 16633 1471 1489.5 0.088 0.090 98.8
OAK 16196 1484 1510.8 0.092 0.093 98.2
KCA 16341 1447 1503.0 0.089 0.092 96.3
DET 16542 1483 1541.2 0.090 0.093 96.2
CIN 16315 1527 1587.8 0.094 0.097 96.2
LAN 16233 1422 1485.1 0.088 0.091 95.8
CLE 16856 1441 1519.1 0.085 0.090 94.9
MIN 15940 1391 1469.6 0.087 0.092 94.7
PHI 16534 1529 1618.2 0.092 0.098 94.5
MIL 16049 1434 1543.8 0.089 0.096 92.9
ARI 16358 1496 1616.5 0.091 0.099 92.5
WAS 16362 1491 1618.9 0.091 0.099 92.1
ATL 16077 1500 1630.7 0.093 0.101 92.0
COL 17060 1511 1644.1 0.089 0.096 91.9

I’m not at all surprised by St. Louis ranking number one. The years I computed the more advance PMR stats, Albert Pujols was head and shoulders above the other first basemen. The Giants have not been great offensively at the position during the time span, but they at least made up a bit for it with defense. The Yankees, with a balance between Jason Giambi and Mark Teixeira over the period come out league average. Next come the players, regulars and semi regulars:

Individual first baseman PMR, 2006-2010, four parameter objective model built on visiting team data.
Fielder In Play Actual Outs Predicted Outs Actual DER Predicted DER Index
Albert Pujols 15592 1659 1460.7 0.106 0.094 113.6
Kendry Morales 5075 553 502.5 0.109 0.099 110.0
Derrek Lee 11528 1169 1074.3 0.101 0.093 108.8
Lance Berkman 11643 1191 1102.7 0.102 0.095 108.0
Mark Teixeira 14352 1449 1360.2 0.101 0.095 106.5
Lyle Overbay 13371 1352 1273.3 0.101 0.095 106.2
Adrian Gonzalez 15367 1558 1483.0 0.101 0.097 105.1
Adam LaRoche 14185 1350 1283.9 0.095 0.091 105.1
Casey Kotchman 9441 947 903.3 0.100 0.096 104.8
Daric Barton 6616 632 605.3 0.096 0.091 104.4
Kevin Millar 7349 672 649.4 0.091 0.088 103.5
Kevin Youkilis 9955 1022 987.7 0.103 0.099 103.5
Nick Johnson 6023 616 597.9 0.102 0.099 103.0
Joey Votto 8267 837 814.3 0.101 0.099 102.8
Paul Konerko 12850 1134 1111.5 0.088 0.087 102.0
Richie Sexson 6934 625 614.2 0.090 0.089 101.8
Carlos Delgado 8863 835 831.7 0.094 0.094 100.4
Aubrey Huff 4968 458 464.3 0.092 0.093 98.6
Carlos Pena 10393 1022 1038.7 0.098 0.100 98.4
James Loney 11073 1008 1024.5 0.091 0.093 98.4
Ross Gload 5009 462 472.5 0.092 0.094 97.8
Justin Morneau 12564 1127 1175.0 0.090 0.094 95.9
Ryan Howard 15200 1413 1487.3 0.093 0.098 95.0
Miguel Cabrera 8644 777 823.5 0.090 0.095 94.4
Mike Jacobs 6755 565 599.1 0.084 0.089 94.3
Sean Casey 5168 431 458.2 0.083 0.089 94.1
Todd Helton 13067 1175 1271.6 0.090 0.097 92.4
Ryan Garko 7416 616 667.0 0.083 0.090 92.3
Billy Butler 6015 524 568.0 0.087 0.094 92.3
Prince Fielder 15328 1355 1473.4 0.088 0.096 92.0
Nomar Garciaparra 4041 324 362.5 0.080 0.090 89.4
Conor Jackson 5935 524 591.3 0.088 0.100 88.6
Scott Hatteberg 4560 378 433.0 0.083 0.095 87.3
Adam Dunn 4546 375 439.0 0.082 0.097 85.4
Jason Giambi 5062 380 448.5 0.075 0.089 84.7

I don’t have too much to say about this list. Albert Pujols comes out on top, which is what I expected. The people lower on the list tend to be bigger, heavier, older players. No one should be surprised to see Giambi and Dunn near the bottom. I thought Carlos Pena would rank higher and Lance Berkman lower.

I would like to point out something about Albert Pujols that shows his ranking is correct, even without a model. Albert was the first fielder to touch the ball on 1659 outs from 2006-2010. The closest an entire team came to that number was 1642 by all the Angels first basemen. When someone signs him for next season, they are getting the total package, great hitter, incredible fielder.

February 28, 2011

Five-Year PMR, Leftfielders

The five year look at PMR using an objective probabilistic model of range continues with leftfielders. Here is the data for the teams in left over the five year span:

Team leftfielder PMR, 2006-2010, four parameter objective model built on visiting team data.
Team In Play Actual Outs Predicted Outs Actual DER Predicted DER Index
CHN 15774 1536 1439.3 0.097 0.091 106.7
ATL 15636 1494 1401.5 0.096 0.090 106.6
PIT 18139 1594 1497.5 0.088 0.083 106.4
KCA 18757 1781 1679.8 0.095 0.090 106.0
NYA 15628 1586 1504.5 0.101 0.096 105.4
TBA 17771 1722 1640.0 0.097 0.092 105.0
MIL 17605 1541 1470.1 0.088 0.084 104.8
BAL 18601 1775 1703.9 0.095 0.092 104.2
ARI 15518 1619 1554.6 0.104 0.100 104.1
COL 17608 1462 1415.8 0.083 0.080 103.3
LAN 15747 1416 1379.9 0.090 0.088 102.6
SFN 15583 1566 1526.4 0.100 0.098 102.6
WAS 13948 1620 1601.8 0.116 0.115 101.1
BOS 15990 1512 1501.2 0.095 0.094 100.7
SEA 18929 1668 1659.2 0.088 0.088 100.5
DET 16254 1624 1618.3 0.100 0.100 100.4
SDN 15744 1574 1578.5 0.100 0.100 99.7
NYN 15106 1529 1544.6 0.101 0.102 99.0
ANA 18139 1617 1642.7 0.089 0.091 98.4
TEX 16842 1603 1630.1 0.095 0.097 98.3
OAK 17009 1614 1649.7 0.095 0.097 97.8
CLE 19103 1585 1622.7 0.083 0.085 97.7
TOR 17621 1372 1439.9 0.078 0.082 95.3
MIN 17584 1584 1665.9 0.090 0.095 95.1
SLN 16933 1455 1548.5 0.086 0.091 94.0
CHA 18013 1555 1663.2 0.086 0.092 93.5
CIN 18020 1512 1627.7 0.084 0.090 92.9
PHI 14144 1380 1485.7 0.098 0.105 92.9
FLO 16420 1423 1548.2 0.087 0.094 91.9
HOU 18038 1318 1489.6 0.073 0.083 88.5

From this one might expect Alfonso Soriano to be a pretty good leftfielder. He’s okay, but I guess everyone else who filled in was really good:

Individual leftfielder PMR, 2006-2010, four parameter objective model built on visiting team data.
Fielder In Play Actual Outs Predicted Outs Actual DER Predicted DER Index
Jay Payton 4591 492 420.2 0.107 0.092 117.1
Matt Diaz 4924 537 461.7 0.109 0.094 116.3
David DeJesus 5747 569 504.3 0.099 0.088 112.8
Johnny Damon 4957 494 456.4 0.100 0.092 108.2
Eric Byrnes 4067 445 411.5 0.109 0.101 108.1
Carl Crawford 14539 1447 1347.8 0.100 0.093 107.4
Ryan Braun 9738 859 819.1 0.088 0.084 104.9
Juan Pierre 5897 560 540.4 0.095 0.092 103.6
Jason Bay 13991 1275 1237.0 0.091 0.088 103.1
Luke Scott 4081 360 352.5 0.088 0.086 102.1
Shannon Stewart 4258 388 382.5 0.091 0.090 101.4
Alfonso Soriano 11797 1163 1148.8 0.099 0.097 101.2
David Murphy 4343 421 419.5 0.097 0.097 100.3
Matt Holliday 15908 1346 1353.0 0.085 0.085 99.5
Scott Podsednik 8245 763 768.9 0.093 0.093 99.2
Juan Rivera 6407 583 589.7 0.091 0.092 98.9
Emil Brown 4408 403 408.7 0.091 0.093 98.6
Fred Lewis 5194 478 492.6 0.092 0.095 97.0
Jason Michaels 5343 436 450.7 0.082 0.084 96.7
Garret Anderson 8106 701 734.6 0.086 0.091 95.4
Luis Gonzalez 6334 553 581.7 0.087 0.092 95.1
Chris Coghlan 4039 364 383.0 0.090 0.095 95.0
Adam Dunn 9522 833 890.6 0.087 0.094 93.5
Barry Bonds 4055 350 376.5 0.086 0.093 93.0
Raul Ibanez 14482 1253 1349.4 0.087 0.093 92.9
Manny Ramirez 8953 750 808.1 0.084 0.090 92.8
Josh Willingham 10019 911 983.5 0.091 0.098 92.6
Adam Lind 4535 356 388.7 0.079 0.086 91.6
Carlos Quentin 4582 390 428.8 0.085 0.094 91.0
Pat Burrell 7730 704 778.7 0.091 0.101 90.4
Craig Monroe 4170 351 392.7 0.084 0.094 89.4
Chris Duncan 4449 376 423.7 0.085 0.095 88.7
Delmon Young 8084 694 802.6 0.086 0.099 86.5
Carlos Lee 14907 1069 1238.1 0.072 0.083 86.3

At least the Royals are above average somewhere in the field. It’s not surprise that Carl Crawford is the best long term leftfielder and the slugging Carlos Lee is the worst. It also appears moving Ryan Braun off third base was the right strategy.

FanGraphs agrees on the top ranking for Carl Crawford, and also rates Soriano and Matt Murton highly.

February 26, 2011

Five-Year PMR, Rightfielders

The five year look at PMR using an objective probabilistic model of range continues with rightfielders. Here is the data for the teams in right over the five year span:

Team rightfielder PMR, 2006-2010, four parameter objective model built on visiting team data.
Team In Play Actual Outs Predicted Outs Actual DER Predicted DER Index
TEX 17063 1816 1664.4 0.106 0.098 109.1
ARI 18222 1596 1486.0 0.088 0.082 107.4
FLO 15005 1744 1628.4 0.116 0.109 107.1
CLE 16448 1718 1623.7 0.104 0.099 105.8
CIN 19014 1859 1768.9 0.098 0.093 105.1
SDN 18477 1642 1565.8 0.089 0.085 104.9
TOR 15576 1522 1462.0 0.098 0.094 104.1
SFN 16350 1756 1692.9 0.107 0.104 103.7
HOU 18435 1702 1646.5 0.092 0.089 103.4
WAS 15208 1819 1767.7 0.120 0.116 102.9
KCA 18252 1767 1739.5 0.097 0.095 101.6
MIN 17388 1639 1617.1 0.094 0.093 101.4
OAK 14344 1705 1684.4 0.119 0.117 101.2
PHI 16848 1667 1653.6 0.099 0.098 100.8
CHA 14957 1592 1583.7 0.106 0.106 100.5
BOS 17028 1569 1561.2 0.092 0.092 100.5
TBA 14795 1697 1694.1 0.115 0.115 100.2
NYA 15578 1657 1653.8 0.106 0.106 100.2
SLN 19101 1590 1607.5 0.083 0.084 98.9
LAN 17788 1538 1563.4 0.086 0.088 98.4
NYN 14262 1637 1689.5 0.115 0.118 96.9
MIL 18218 1684 1749.2 0.092 0.096 96.3
BAL 17931 1674 1740.7 0.093 0.097 96.2
ATL 17318 1538 1606.5 0.089 0.093 95.7
SEA 17452 1657 1745.2 0.095 0.100 94.9
ANA 15609 1604 1691.5 0.103 0.108 94.8
PIT 18249 1650 1747.4 0.090 0.096 94.4
CHN 16072 1632 1733.3 0.102 0.108 94.2
DET 15483 1533 1645.3 0.099 0.106 93.2
COL 18078 1404 1536.7 0.078 0.085 91.4

A comparison of FanGraphs range runs shows that four of the top eight teams are shared between the two lists. The top teams on the objective PMR list tend to have younger outfielders:

Individual rightfielder PMR, 2006-2010, four parameter objective model built on visiting team data.
Fielder In Play Actual Outs Predicted Outs Actual DER Predicted DER Index
Nelson Cruz 6996 809 706.4 0.116 0.101 114.5
Alexis Rios 8847 859 783.4 0.097 0.089 109.7
Jay Bruce 6701 686 629.4 0.102 0.094 109.0
Randy Winn 8227 940 867.5 0.114 0.105 108.4
Justin Upton 8906 799 738.7 0.090 0.083 108.2
Nick Swisher 5275 639 599.4 0.121 0.114 106.6
Hunter Pence 10667 1035 976.6 0.097 0.092 106.0
Jeremy Hermida 7305 843 797.4 0.115 0.109 105.7
Shin-Soo Choo 6720 702 664.2 0.104 0.099 105.7
Carlos Quentin 4297 417 397.1 0.097 0.092 105.0
Ryan Ludwick 8676 751 721.7 0.087 0.083 104.1
Jayson Werth 7953 828 798.4 0.104 0.100 103.7
Gabe Gross 4208 472 459.6 0.112 0.109 102.7
Austin Kearns 8814 1034 1008.4 0.117 0.114 102.5
Jermaine Dye 10036 1090 1067.6 0.109 0.106 102.1
J.D. Drew 12711 1157 1133.0 0.091 0.089 102.1
Mark Teahen 5593 553 544.8 0.099 0.097 101.5
Brian Giles 10894 900 908.1 0.083 0.083 99.1
Kosuke Fukudome 4981 515 521.4 0.103 0.105 98.8
Ichiro Suzuki 10925 1098 1116.5 0.101 0.102 98.3
Ken Griffey 5169 452 461.7 0.087 0.089 97.9
Corey Hart 11591 1081 1121.0 0.093 0.097 96.4
Milton Bradley 4013 437 455.1 0.109 0.113 96.0
Vladimir Guerrero 6460 670 697.8 0.104 0.108 96.0
Michael Cuddyer 10977 940 983.7 0.086 0.090 95.6
Geoff Jenkins 4295 388 406.2 0.090 0.095 95.5
Nick Markakis 16303 1503 1574.9 0.092 0.097 95.4
Shawn Green 4313 423 446.3 0.098 0.103 94.8
Jeff Francoeur 15257 1462 1541.6 0.096 0.101 94.8
Andre Ethier 9976 850 899.7 0.085 0.090 94.5
Bobby Abreu 13317 1294 1369.1 0.097 0.103 94.5
Jose Guillen 8014 747 791.7 0.093 0.099 94.4
Xavier Nady 6061 583 643.0 0.096 0.106 90.7
Juan Encarnacion 4275 344 380.6 0.080 0.089 90.4
Brad Hawpe 13340 1024 1144.4 0.077 0.086 89.5
Magglio Ordonez 10859 1022 1156.1 0.094 0.106 88.4

The bottom of the list does tend to be on the older side. I’m always surprised when Ichiro doesn’t do that well in PMR, especially in rightfield. However, he’s no longer a young man, so given average standing, it’s not bad at all. Range Runs likes Ichiro at lot better than objective PMR. Nelson Cruz, Randy Winn and Jay Bruce rate highly there, too, so there’s agreement as well.

February 24, 2011

Five-Year PMR, Third Basemen

The five year look at PMR using an objective probabilistic model of range continues with third baseman. Here is the data for the teams at third over the five year span:

Team third baseman PMR, 2006-2010, four parameter objective model built on visiting team data.
Team In Play Actual Outs Predicted Outs Actual DER Predicted DER Index
DET 16601 2278 1985.8 0.137 0.120 114.7
BOS 15756 2127 1963.4 0.135 0.125 108.3
BAL 16429 2070 1965.7 0.126 0.120 105.3
SEA 17219 2155 2056.5 0.125 0.119 104.8
WAS 16758 2079 1994.8 0.124 0.119 104.2
ANA 15651 1882 1827.8 0.120 0.117 103.0
TBA 15762 1991 1934.9 0.126 0.123 102.9
OAK 16234 2040 1985.6 0.126 0.122 102.7
SLN 17200 2120 2066.4 0.123 0.120 102.6
PIT 16839 2200 2153.5 0.131 0.128 102.2
LAN 16161 2005 1986.0 0.124 0.123 101.0
MIL 16036 1954 1936.2 0.122 0.121 100.9
SFN 15375 1926 1911.8 0.125 0.124 100.7
KCA 16406 1892 1878.3 0.115 0.114 100.7
PHI 16533 2100 2104.9 0.127 0.127 99.8
CIN 16375 1895 1900.1 0.116 0.116 99.7
CHN 15218 1844 1866.0 0.121 0.123 98.8
ATL 16062 1875 1899.5 0.117 0.118 98.7
MIN 15919 1984 2020.2 0.125 0.127 98.2
NYA 15941 1859 1893.5 0.117 0.119 98.2
NYN 15839 1936 1990.1 0.122 0.126 97.3
COL 16985 1885 1953.7 0.111 0.115 96.5
HOU 15943 1922 1997.4 0.121 0.125 96.2
CLE 16767 2048 2131.1 0.122 0.127 96.1
TOR 16319 1977 2060.7 0.121 0.126 95.9
CHA 17128 2136 2248.1 0.125 0.131 95.0
SDN 16032 1846 1960.5 0.115 0.122 94.2
ARI 16245 1851 1966.5 0.114 0.121 94.1
TEX 16725 1831 1966.0 0.109 0.118 93.1
FLO 16004 1811 1955.9 0.113 0.122 92.6

If a team used Adrian Beltre over the the last five seasons, they were pretty good at third base. Detroit leads the pack, thanks to Brandon Inge:

Individual third basemen PMR, 2006-2010, four parameter objective model built on visiting team data.
Fielder In Play Actual Outs Predicted Outs Actual DER Predicted DER Index
Brandon Inge 12901 1772 1534.5 0.137 0.119 115.5
Evan Longoria 7774 1054 943.6 0.136 0.121 111.7
Scott Rolen 12222 1579 1461.0 0.129 0.120 108.1
Eric Chavez 4895 639 597.5 0.131 0.122 106.9
Melvin Mora 11353 1459 1369.8 0.129 0.121 106.5
Mike Lowell 9823 1298 1221.1 0.132 0.124 106.3
Nick Punto 4365 598 562.4 0.137 0.129 106.3
Jack Hannahan 4222 543 512.6 0.129 0.121 105.9
Adrian Beltre 14499 1840 1745.0 0.127 0.120 105.4
Chone Figgins 7441 905 859.0 0.122 0.115 105.3
Ryan Zimmerman 14647 1817 1745.1 0.124 0.119 104.1
Ian Stewart 5311 639 614.6 0.120 0.116 104.0
Pedro Feliz 12884 1672 1606.9 0.130 0.125 104.0
Andy LaRoche 5619 732 704.9 0.130 0.125 103.8
Alex Gordon 6409 753 726.3 0.117 0.113 103.7
Jhonny Peralta 4124 528 513.9 0.128 0.125 102.7
Joe Crede 7505 1032 1009.2 0.138 0.134 102.3
Mark Teahen 5416 646 640.9 0.119 0.118 100.8
Chipper Jones 10575 1243 1245.2 0.118 0.118 99.8
Alex Rodriguez 12766 1476 1508.3 0.116 0.118 97.9
Ty Wigginton 4642 546 557.9 0.118 0.120 97.9
Casey Blake 10279 1257 1289.0 0.122 0.125 97.5
Jose Bautista 6348 802 825.7 0.126 0.130 97.1
David Wright 14931 1808 1871.4 0.121 0.125 96.6
Aramis Ramirez 11546 1351 1405.1 0.117 0.122 96.2
Edwin Encarnacion 11086 1264 1318.5 0.114 0.119 95.9
Chad Tracy 4557 522 547.6 0.115 0.120 95.3
Pablo Sandoval 4718 559 591.1 0.118 0.125 94.6
Kevin Kouzmanoff 11031 1268 1346.5 0.115 0.122 94.2
Mark Reynolds 9924 1130 1205.9 0.114 0.122 93.7
Troy Glaus 8065 923 989.4 0.114 0.123 93.3
Garrett Atkins 9743 1045 1120.7 0.107 0.115 93.2
Hank Blalock 4054 457 490.9 0.113 0.121 93.1
Casey McGehee 4211 467 508.5 0.111 0.121 91.8
Miguel Cabrera 6241 715 789.2 0.115 0.126 90.6
Michael Young 5549 597 663.5 0.108 0.120 90.0
Jorge Cantu 4607 485 562.2 0.105 0.122 86.3

Of all the positions run so far, this one matches my expectations the best. The only person near the top of the list that strikes me as not belonging is Melvin Mora, and indeed FanGraphs ranks him low in terms of range runs.

You can also see how much Adrian Beltre will help the Rangers defensively. For every 100 outs Young would field, Beltre should pick up 116 to 117. Over 200 Young outs, Beltre will turn over a full game’s worth.

February 23, 2011

Five-Year PMR, Centerfielders

The five year look at fielding using an objective probabilistic model of range continues with centerfielders. As always, we start with the complete team data:

Team centerfielder PMR, 2006-2010, four parameter objective model built on visiting team data.
Team In Play Actual Outs Predicted Outs Actual DER Predicted DER Index
NYN 14066 2194 1981.0 0.156 0.141 110.8
ANA 15635 2198 2109.0 0.141 0.135 104.2
MIN 15363 2219 2130.9 0.144 0.139 104.1
SEA 12671 2257 2171.5 0.178 0.171 103.9
HOU 14431 2113 2049.3 0.146 0.142 103.1
FLO 16302 2163 2101.4 0.133 0.129 102.9
DET 16457 2195 2141.6 0.133 0.130 102.5
CLE 16721 2108 2061.6 0.126 0.123 102.2
CIN 14823 2172 2132.6 0.147 0.144 101.8
CHN 14225 1975 1944.1 0.139 0.137 101.6
SDN 16476 2117 2090.9 0.128 0.127 101.2
ATL 15114 1949 1926.2 0.129 0.127 101.2
COL 12832 1949 1931.8 0.152 0.151 100.9
PHI 13987 2008 1995.2 0.144 0.143 100.6
WAS 14229 2228 2221.1 0.157 0.156 100.3
BOS 16397 2063 2059.8 0.126 0.126 100.2
SLN 14393 1942 1937.5 0.135 0.135 100.2
SFN 13812 2116 2116.5 0.153 0.153 100.0
NYA 12545 2093 2101.0 0.167 0.167 99.6
KCA 16621 2147 2160.6 0.129 0.130 99.4
BAL 14513 2229 2244.9 0.154 0.155 99.3
TBA 15993 2195 2210.7 0.137 0.138 99.3
CHA 15444 2045 2064.3 0.132 0.134 99.1
PIT 15893 2057 2092.6 0.129 0.132 98.3
MIL 14597 2089 2144.3 0.143 0.147 97.4
ARI 16596 2040 2097.5 0.123 0.126 97.3
LAN 15463 1869 1926.8 0.121 0.125 97.0
OAK 12127 2074 2146.9 0.171 0.177 96.6
TOR 14658 1880 1957.1 0.128 0.134 96.1
TEX 16113 2060 2165.3 0.128 0.134 95.1

The Mets are just off the chart in centerfield, thanks to Carlos Beltran.

996815061385.90.1510.139108.7715410991015.70.1540.142108.2854511901100.70.1390.129108.16158889828.40.1440.135107.35042913862.40.1810.171105.94643691660.60.1490.142104.67094940903.90.1330.127104.04290615596.00.1430.139103.21151116431592.80.1430.138103.14150727707.30.1750.170102.84150535522.10.1290.126102.51342518871843.30.1410.137102.45666943921.60.1660.163102.31175715131488.70.1290.127101.6799911751161.20.1470.145101.26094844835.00.1380.137101.1976413531338.70.1390.137101.1762211851174.90.1550.154100.96687913906.60.1370.136100.7712310381032.80.1460.145100.51044515451543.60.1480.148100.11147515401544.60.1340.13599.75741898900.40.1560.15799.7716510011004.50.1400.14099.64389565568.00.1290.12999.56447802809.60.1240.12699.17686951969.00.1240.12698.1820710561076.10.1290.13198.11166615021539.20.1290.13297.66051765791.30.1260.13196.74883643665.50.1320.13696.64874595616.60.1220.12796.55253758788.00.1440.15096.21251115721646.20.1260.13295.54990636675.90.1270.13594.14142602649.10.1450.15792.7
Individual centerfielder PMR, 2006-2010, four parameter objective model built on visiting team data.
Fielder In Play Actual Outs Predicted Outs Actual DER Predicted DER Index
Carlos Beltran
Willy Taveras
Coco Crisp
Carlos Gomez
Franklin Gutierrez
Alexis Rios
Andruw Jones
Denard Span
Torii Hunter
Rajai Davis
Alfredo Amezaga
Curtis Granderson
Melky Cabrera
Grady Sizemore
Shane Victorino
Jim Edmonds
B.J. Upton
Adam Jones
Gary Matthews
Michael Bourn
Aaron Rowand
Mike Cameron
Corey Patterson
Marlon Byrd
Joey Gathright
Juan Pierre
Matt Kemp
Nate McLouth
Chris Young
Cody Ross
Josh Hamilton
Jacoby Ellsbury
David DeJesus
Vernon Wells
Andrew McCutchen
Mark Kotsay

The Twins should be proud that they produced fielding gems Torii Hunter and Denard Span. I’m somewhat surprised Jacoby Ellsbury does poorly, but maybe the Red Sox realized that when they moved him to left at the start of the 2010 season. Josh Hamilton clearly should not be the regular CF for the Rangers.

FanGraphs’ Range Runs doesn’t like Wells or Kotsay, but rates Beltran a lot lower. In general, however, it looks like there is more agreement here than among the second baseman.

February 22, 2011

Five Year PMR, Second Basemen

My five-year survey of fielders using objective PMR continues with second basemen. (You can find all Probabilistic Model of Range posts here.) We start with the composite team view:

Team second baseman PMR, 2006-2010, four parameter objective model built on visiting team data.
Team In Play Actual Outs Predicted Outs Actual DER Predicted DER Index
NYA 16081 2674 2462.3 0.166 0.153 108.6
COL 17011 2796 2590.3 0.164 0.152 107.9
ARI 16276 2672 2498.8 0.164 0.154 106.9
PHI 16614 2596 2481.2 0.156 0.149 104.6
TEX 16691 2721 2611.7 0.163 0.156 104.2
OAK 16203 2653 2563.1 0.164 0.158 103.5
TOR 16418 2663 2590.4 0.162 0.158 102.8
CLE 16901 2687 2616.5 0.159 0.155 102.7
NYN 16069 2476 2428.2 0.154 0.151 102.0
HOU 15999 2467 2427.4 0.154 0.152 101.6
BOS 15779 2509 2478.1 0.159 0.157 101.2
SEA 16591 2562 2531.3 0.154 0.153 101.2
ATL 16587 2585 2554.0 0.156 0.154 101.2
MIL 16105 2373 2351.5 0.147 0.146 100.9
FLO 16105 2493 2474.4 0.155 0.154 100.8
LAN 16188 2573 2555.3 0.159 0.158 100.7
WAS 16839 2518 2517.7 0.150 0.150 100.0
ANA 15624 2549 2554.1 0.163 0.163 99.8
DET 16562 2593 2598.1 0.157 0.157 99.8
MIN 16082 2630 2635.0 0.164 0.164 99.8
CIN 16377 2478 2484.3 0.151 0.152 99.7
CHA 16459 2541 2552.9 0.154 0.155 99.5
CHN 15242 2340 2379.5 0.154 0.156 98.3
SFN 15403 2308 2360.4 0.150 0.153 97.8
SLN 17217 2555 2620.3 0.148 0.152 97.5
TBA 15843 2408 2472.2 0.152 0.156 97.4
BAL 17338 2474 2573.4 0.143 0.148 96.1
SDN 16048 2442 2566.9 0.152 0.160 95.1
KCA 16381 2448 2575.9 0.149 0.157 95.0
PIT 16877 2218 2472.8 0.131 0.147 89.7

The Yankees dominate this list, while I’m not surprised the Royals, a very poor defensive team, come up near the bottom. The Giants, who like to play veterans, tend toward the bottom as well. Next, the individual players, the regulars and semi regulars:

Individual second baseman PMR, 2006-2010, four parameter objective model built on visiting team data.
Fielder In Play Actual Outs Predicted Outs Actual DER Predicted DER Index
Orlando Hudson 12946 2221 2031.8 0.172 0.157 109.3
Robinson Cano 14648 2415 2237.8 0.165 0.153 107.9
Mark Ellis 11955 2018 1894.1 0.169 0.158 106.5
Ian Kinsler 12629 2103 1976.6 0.167 0.157 106.4
Clint Barmes 5285 855 805.1 0.162 0.152 106.2
Jamey Carroll 6028 1000 943.2 0.166 0.156 106.0
Jose Lopez 11573 1878 1772.7 0.162 0.153 105.9
Chase Utley 14380 2270 2152.4 0.158 0.150 105.5
Adam Kennedy 7914 1276 1224.0 0.161 0.155 104.2
Dustin Pedroia 10181 1645 1588.2 0.162 0.156 103.6
Kazuo Matsui 7745 1207 1167.3 0.156 0.151 103.4
Placido Polanco 10775 1723 1683.6 0.160 0.156 102.3
Aaron Hill 12243 1968 1928.3 0.161 0.158 102.1
Howie Kendrick 8266 1375 1355.0 0.166 0.164 101.5
Josh Barfield 5445 879 874.1 0.161 0.161 100.6
Dan Uggla 15092 2318 2318.3 0.154 0.154 100.0
Rickie Weeks 10082 1457 1459.3 0.145 0.145 99.8
Brandon Phillips 14595 2213 2217.3 0.152 0.152 99.8
Kelly Johnson 9706 1474 1481.7 0.152 0.153 99.5
Felipe Lopez 5964 898 904.0 0.151 0.152 99.3
Aaron Miles 5054 754 767.0 0.149 0.152 98.3
Tadahito Iguchi 6725 1004 1030.2 0.149 0.153 97.5
Mark Grudzielanek 7038 1052 1085.3 0.149 0.154 96.9
Ronnie Belliard 7049 1025 1057.5 0.145 0.150 96.9
Luis Castillo 10720 1608 1668.0 0.150 0.156 96.4
Brian Roberts 13706 1945 2038.1 0.142 0.149 95.4
Ray Durham 6439 925 978.5 0.144 0.152 94.5
Freddy Sanchez 10154 1398 1483.3 0.138 0.146 94.2
Jeff Kent 6600 970 1043.5 0.147 0.158 93.0

I like that the peripatetic Orlando Hudson comes up at the top of this list. He played for three teams over these five seasons. I sometimes wonder if the way the model is built makes fielders on strong offensive teams look good (like Robinson Cano). The Yankees send a number of good left-handed hitters to the plate, so when a model is built for Yankee Stadium, one might expect the ability for a second baseman to field a ball there is tougher than it may be in reality. Hudson, coming out high on the list, playing for fairly weak offensive teams argues against that.

At the other end is an old Jeff Kent. Right smack in the middle, however, is Dan Uggla, who owns a reputation as a poor fielder. In fact, there is a big discrepancy here between the objective model and FanGraphs range run ranking. The latter puts Cano, Uggla and Orlando Hudson all near the bottom of the list. This is fairly huge. Looking at Bill James Online, Hudson was near the top in +/- in three of the five years.

Here’s a closer look at Hudson and Cano:

Type of Batted Ball Hudson’s Index Cano’s Index
Ground 103.2 106.5*
Line 125.8 112.6
Pop 131.5* 112.6

*Highest

So Hudson caught a lot of line drives and is a pop up hog. Cano’s ranking looks pretty good, however. I don’t know if you can be lucky with line drives over a five year period. Maybe Orlando is really good at positioning himself.

February 21, 2011

Five Year PMR, Shortstops

The long term fielding study continues with a look at shortstops. (See yesterday’s post for background.) The following table shows how the teams performed as a whole at the position over the last five years:

Team shortstop PMR, 2006-2010, four parameter objective model built on visiting team data.
Team In Play Actual Outs Predicted Outs Actual DER Predicted DER Index
COL 17144 2884 2684.1 0.168 0.157 107.4
ATL 16331 2696 2544.0 0.165 0.156 106.0
MIN 16153 2803 2670.7 0.174 0.165 105.0
SLN 17324 2828 2721.8 0.163 0.157 103.9
TEX 16725 2713 2612.6 0.162 0.156 103.8
LAN 16314 2744 2643.8 0.168 0.162 103.8
MIL 16139 2622 2535.9 0.162 0.157 103.4
PIT 16925 2804 2746.7 0.166 0.162 102.1
TOR 16460 2617 2574.3 0.159 0.156 101.7
KCA 16492 2635 2595.6 0.160 0.157 101.5
PHI 17622 2606 2569.6 0.148 0.146 101.4
SDN 16243 2599 2564.5 0.160 0.158 101.3
ANA 15703 2510 2481.2 0.160 0.158 101.2
HOU 16079 2637 2612.2 0.164 0.162 101.0
CHA 16511 2707 2680.3 0.164 0.162 101.0
CHN 15291 2499 2473.8 0.163 0.162 101.0
BAL 16583 2642 2643.3 0.159 0.159 100.0
DET 16622 2566 2575.4 0.154 0.155 99.6
SEA 16618 2590 2605.7 0.156 0.157 99.4
CLE 16968 2764 2784.9 0.163 0.164 99.3
CIN 16487 2497 2521.5 0.151 0.153 99.0
ARI 17295 2524 2554.6 0.146 0.148 98.8
TBA 16778 2420 2451.4 0.144 0.146 98.7
WAS 16945 2623 2664.9 0.155 0.157 98.4
FLO 17005 2443 2486.5 0.144 0.146 98.2
BOS 15942 2448 2498.9 0.154 0.157 98.0
OAK 16251 2588 2646.0 0.159 0.163 97.8
SFN 15499 2509 2565.1 0.162 0.166 97.8
NYA 16663 2364 2496.1 0.142 0.150 94.7
NYN 16262 2491 2641.2 0.153 0.162 94.3

I’m surprised Texas does this well, as Michael Young was the shortstop for a good period of time here, and he’s not known for his defensive ability. Minnesota’s ranking makes some sense as the team is known for good defense. Boston’s rotating players at the position took a toll. It’s been a terrible half-decade in New York. Despite their high-profile shortstops, this study show they were poor fielders.

On to the individuals. The following table shows shortstops on the field for at least 5000 balls in play over the last five seasons. That gives us the regulars and semi-regulars.

Fielder In Play Actual Outs Predicted Outs Actual DER Predicted DER Index
Brendan Ryan 5653 1004 885.8 0.178 0.157 113.3
Yunel Escobar 8574 1464 1332.7 0.171 0.155 109.9
Elvis Andrus 5577 951 878.4 0.171 0.158 108.3
Troy Tulowitzki 11389 1928 1779.9 0.169 0.156 108.3
Jack Wilson 10305 1759 1665.8 0.171 0.162 105.6
Jason Bartlett 12100 1939 1841.6 0.160 0.152 105.3
Adam Everett 7554 1263 1205.7 0.167 0.160 104.8
Rafael Furcal 11168 1901 1819.9 0.170 0.163 104.5
Alexei Ramirez 6102 1008 968.8 0.165 0.159 104.0
Cristian Guzman 6351 1012 976.9 0.159 0.154 103.6
Erick Aybar 7345 1197 1159.1 0.163 0.158 103.3
Cesar Izturis 9194 1514 1472.7 0.165 0.160 102.8
Ronny Cedeno 7367 1210 1182.9 0.164 0.161 102.3
Jimmy Rollins 14724 2203 2154.5 0.150 0.146 102.3
J.J. Hardy 10454 1688 1654.7 0.161 0.158 102.0
Marco Scutaro 8509 1352 1334.1 0.159 0.157 101.3
Alex Gonzalez 9437 1507 1489.8 0.160 0.158 101.2
Michael Young 9493 1498 1483.6 0.158 0.156 101.0
Ryan Theriot 7811 1263 1255.1 0.162 0.161 100.6
Jhonny Peralta 10774 1782 1776.7 0.165 0.165 100.3
John McDonald 5264 811 811.2 0.154 0.154 100.0
Juan Uribe 8144 1329 1337.5 0.163 0.164 99.4
Omar Vizquel 7598 1247 1255.2 0.164 0.165 99.3
Carlos Guillen 5410 854 861.0 0.158 0.159 99.2
Bobby Crosby 6977 1115 1128.6 0.160 0.162 98.8
Orlando Cabrera 14196 2243 2276.2 0.158 0.160 98.5
Hanley Ramirez 15208 2191 2225.0 0.144 0.146 98.5
Julio Lugo 6982 1074 1090.3 0.154 0.156 98.5
Khalil Greene 8178 1264 1284.9 0.155 0.157 98.4
Yuniesky Betancourt 15070 2318 2372.4 0.154 0.157 97.7
Miguel Tejada 12527 1963 2009.9 0.157 0.160 97.7
Edgar Renteria 11358 1746 1797.3 0.154 0.158 97.1
Stephen Drew 13184 1869 1946.1 0.142 0.148 96.0
David Eckstein 5814 880 921.7 0.151 0.159 95.5
Jose Reyes 12550 1930 2041.1 0.154 0.163 94.6
Felipe Lopez 5884 870 931.7 0.148 0.158 93.4
Derek Jeter 14916 2076 2241.1 0.139 0.150 92.6

You can see why Texas ranked so high. Michael Young played a little above league average, while Elvis Andrus came out near the top of the rankings. Of the longer term shortstops, Troy Tulowitzki brought the Rockies up to the number one team ranking.

Derek Jeter ranks last, despite his good 2009 season, which brings me back to the whole idea of presenting a more objective model, biases in scoring hit location make some players look better. This system shows Jeter turning 165 fewer balls into outs than expected. Looking at FanGraphs and Baseball Reference, both which use batted ball location data, this objective model is very much in line with their results. Jeter is a poor fielder compared to the average shortstop.

The big disagreement comes with Omar Vizquel. FanGraphs rates his range very high, and BR shows him positively contributing to the defense. Since Omar has such a great reputation as a fielder, maybe scorers are putting balls he doesn’t get to more out of reach than than they should be. It would make sense, at Omar’s age, that he would not be that great defensively anymore.

As always, I’m interested in your thoughts.

February 20, 2011

Five Year PMR

As a follow up to my last post on objective Probabilistic Model of Range (PMR), I down loaded the latest retrosheet data, complete through 2010. I thought I might actually extend the model with vectors, but few events have hit location attached.

On the other hand, retrosheet data is very consistent from year to year with classifying balls in play as grounders, flies, liners and pops. So I added that to the model. The model now contains three objective parameters, batter-handedness, pitcher handedness, and stadium, and well as the slightly subjective batted ball type.

I also decide to look at these models longer term, so I’ll do a series of five-year studies, covering fielding from 2006 to 2010. For each model for a season, I used five years of data, not including the year in question. This way, none of the data for the season in question was used to train the model. I used the 2005 through 2010 data. For example, the model for 2006 is built from 2005, 2007, 2008, 2009, 2010. The 2009 data is built from 2005-2008 and 2010. The only exception are the models for Target Field, which only existed in 2010. I also only built the model with data on the visiting fielders, so a great or terrible defender for a team would not influence the model that much.

Just to review, PMR determines the probability of a batted ball being turned into an out based on a set of parameters. Adding those probabilities up for each ball in play gives us the expected number of outs. Calculating an index by the formula (100*actual outs/predicted outs) allows a ranking, with number over 100 good and numbers under 100 poor. When I used BIS data, I included a direction for the ball, and a measure of distance. Those elements are subjective, however, so leaving them out removes the biases of the scorers. Since the number of balls in play an individual fielder can handle is small in any year, by looking at a longer term model we should get a better picture of who are the best glove men.

Compiling the data for five years, the Red Sox were the best defensive team in the majors.

Team PMR, 2006-2010
Team In Play Actual Outs Predicted Outs Actual DER Predicted DER Index
BOS 17014 11800 11524.4 0.694 0.677 102.4
NYA 17125 11878 11636.5 0.694 0.680 102.1
COL 17595 12096 11894.7 0.687 0.676 101.7
NYN 17490 12168 11973.4 0.696 0.685 101.6
TOR 17110 11874 11691.9 0.694 0.683 101.6
ANA 17487 12023 11880.8 0.688 0.679 101.2
TEX 17830 12260 12117.6 0.688 0.680 101.2
DET 17710 12214 12084.1 0.690 0.682 101.1
SEA 17881 12370 12253.6 0.692 0.685 101.0
SLN 17996 12452 12327.8 0.692 0.685 101.0
ATL 17229 11918 11816.6 0.692 0.686 100.9
SFN 16821 11752 11653.6 0.699 0.693 100.8
PHI 17548 12164 12077.2 0.693 0.688 100.7
TBA 17142 11856 11768.6 0.692 0.687 100.7
ARI 17339 11945 11874.9 0.689 0.685 100.6
CHN 16667 11615 11544.6 0.697 0.693 100.6
LAN 16889 11745 11690.8 0.695 0.692 100.5
CLE 18081 12369 12310.3 0.684 0.681 100.5
OAK 17441 12122 12075.0 0.695 0.692 100.4
MIN 17929 12336 12283.0 0.688 0.685 100.4
SDN 17237 12032 12019.3 0.698 0.697 100.1
KCA 17765 12096 12096.4 0.681 0.681 100.0
BAL 18048 12392 12430.5 0.687 0.689 99.7
WAS 18063 12432 12469.4 0.688 0.690 99.7
FLO 17406 11830 11860.0 0.680 0.681 99.7
CIN 17540 12098 12143.1 0.690 0.692 99.6
MIL 17408 11921 11968.0 0.685 0.687 99.6
PIT 18280 12382 12481.3 0.677 0.683 99.2
HOU 17443 11939 12083.1 0.684 0.693 98.8
CHA 17630 12044 12197.7 0.683 0.692 98.7

I was somewhat surprised to see the Yankees second. The team has brought in a few good defenders in recent years. The Rockies were the best team in the NL, and their speedy outfielders make a difference in the big park. The White Sox inhabit the bottom of the rankings, with the Astros bringing up the rear in the NL.

Over the next week studies will include each position, as well as defenses behind pitchers.

February 10, 2011

Objective PMR

In November I made note of this post, Jeter Uncertainty Principle, which linked to a Baseball Prospectus article that wondered if the batted ball fielding systems were overestimating the range of players.

the spread of observed performance in metrics like UZR and DRS is much, much smaller than that of metrics like nFRAA (or Tom Tango’s With Or Without You system, which is similarly down on Jeter’s fielding ability).

This got me thinking, but unfortunately my computer crashed and I lost my MSSql database with my fielding data. I recently downloaded the retrosheet data through 2009, and decided to do an experiment.

The Probabilistic Model of Range uses six parameters to determine the probability of a ball in play being turned into an out. Three of those, the direction, velocity, and batted ball type, are subjective measures. Three of them, the handedness of the batter, the handedness of the pitcher, and the park are objective. If I used just those three objective measures to construct the model, would I see a bigger spread in the data?

I last computed PMR in 2008, and here is the team listing from that year. I built the retrosheet model a bit differently. I only use visiting fielders so a great or terrible home fielder doesn’t over influence the model, but I used a different set of data to build the model. Instead of using the same year, I used the previous three seasons and the following season. So for 2008, I used the 2005, 2006, 2007 and 2009 seasons. Here are the results for the teams:

Objective PMR, 2008 Teams, model built with visiting team data from 2005, 2006, 2007, and 2009
Team In Play Actual Outs Predicted Outs Actual DER Predicted DER Index
BOS 4229 2954 2825.582 0.699 0.668 104.5
TBA 4265 3024 2908.787 0.709 0.682 104.0
TOR 4217 2962 2881.598 0.702 0.683 102.8
CHN 4164 2930 2857.973 0.704 0.686 102.5
ANA 4374 3024 2973.781 0.691 0.680 101.7
OAK 4292 2992 2944.710 0.697 0.686 101.6
MIL 4362 3048 3006.376 0.699 0.689 101.4
SLN 4604 3198 3159.799 0.695 0.686 101.2
FLO 4342 3005 2970.709 0.692 0.684 101.2
PHI 4399 3061 3023.389 0.696 0.687 101.2
ATL 4392 3040 3008.062 0.692 0.685 101.1
KCA 4416 3039 3008.497 0.688 0.681 101.0
NYN 4341 3030 3004.247 0.698 0.692 100.9
NYA 4351 2963 2941.234 0.681 0.676 100.7
COL 4535 3075 3056.998 0.678 0.674 100.6
CLE 4514 3094 3078.160 0.685 0.682 100.5
HOU 4298 2999 2988.713 0.698 0.695 100.3
BAL 4539 3120 3112.376 0.687 0.686 100.2
DET 4536 3107 3104.271 0.685 0.684 100.1
WAS 4420 3044 3044.363 0.689 0.689 100.0
LAN 4277 2951 2950.631 0.690 0.690 100.0
MIN 4584 3141 3143.573 0.685 0.686 99.9
PIT 4688 3166 3187.588 0.675 0.680 99.3
ARI 4236 2903 2927.417 0.685 0.691 99.2
SEA 4514 3069 3095.687 0.680 0.686 99.1
SDN 4426 3080 3109.792 0.696 0.703 99.0
SFN 4237 2904 2942.056 0.685 0.694 98.7
CHA 4395 3006 3058.851 0.684 0.696 98.3
TEX 4671 3126 3185.941 0.669 0.682 98.1
CIN 4313 2904 2990.839 0.673 0.693 97.1

The number of balls in play do not match up precisely between BIS and Retrosheet, and I’m exploring why (in the case of Baltimore, it was a foul ball error). The ordering is different, which isn’t surprising given the different model and the way the model was built. What I would like you to notice, however, is that the spread of the index is indeed wider than the original PMR model shows. So a probabilistic model that uses only objective parameters also shows a larger spread.

I’m going to try to get more comfortable with the data, making sure the differences are just foul pops. Then, I’ll build some positional models to see what happens there.

August 30, 2010

Counting Fielder Outs

Rob Neyer attended the PITCHf/x Summit, and talks about some ideas for measuring fielding with Hit Tracker:

As Greg pointed out, all of our current “new” defensive metrics are “zone-based”; that is, they begin by separating the field into distinct zones, noting in which zone a play has been made, and then apportioning credit (or not) when a fielder makes a play (or doesn’t) in that zone. I’m simplifying, of course, but essentially every reputable system now in use shares two significant defects: the “zones” don’t fit neatly into today’s highly variable outfield dimensions, and the systems don’t have any way to account for the fielder’s starting position.

What’s more, Greg argued that the great majority of plays made by outfielders are essentially irrelevant. If an outfielder doesn’t have to move to catch a line drive, that play still counts in his favor. If he can’t catch that same fly ball because he was pulled way in with the potential winning run on third base in the ninth inning, that play still counts against him.

What Greg proposes is stripping out the plays that any outfielder could make, and the plays that perhaps no outfielder could make, and look instead at only the plays that might be made. And this might be done given FIELDf/x’s ability to record an outfielder’s starting position, the hang time of the batted ball, and the baseball’s landing point.

No, nothing needs to be stripped out. The starting position of the fielder, the hang time and the landing position of the ball become part of the probabilistic model. It’s going to be very interesting to see how the two systems measure fielding.

Note that one isn’t necessarily better than the other. As I’ve noted before, my Probabilistic Model of Range doesn’t measure just range, but the ability of a fielder to position himself well. From the description above, True Range would measure the ability of a fielder to move to a batted ball, his actual range. If the fielder doesn’t position himself well, there should be a penalty for that also, and the traditional zone based system catches those.

If we see a player who does well in a zone system but poorly in a range system, we might conclude he (or his coaches) position him well.

March 30, 2009 March 30, 2009

Defensive Mention

Thanks to Geoff Baker for a mention of PMR in his article on defense:

Dewan introduced a well-known Plus/Minus system a few years back, then used it as the main component of his new DRS metric. But there are other top systems, like Mitchel Lichtman’s Ultimate Zone Rating (UZR) or David Pinto’s Probabilistic Model of Range (PMR), that intelligent baseball minds swear are just as good or superior.

“I like to look at all of them,” said Tony Blengino, a special assistant to Mariners general manager Jack Zduriencik, who oversees the team’s new stats branch. “It’s kind of an emerging frontier of statistical analysis, and I think they all bring something to the table. More often than not they point in similar directions. Every now and then, they don’t. But I think there’s progress being made in the fielding analysis. I think they all have something to say.”

December 9, 2008