Category Archives: Probabilistic Model of Range

November 22, 2016

Pitchers and Defense

Baseball Think Factory links to a developing debate between Joe Posnanski and Sean Forman about the defensive adjustments to pitcher WAR at Baseball Reference. As someone who dabbled in measuring defense, figuring out how much some defense helped or hurt a particular pitcher is a tough problem.

The Posnanski piece wonders why rWAR rates Justin Verlander so much higher than Rick Porcello. The difference comes down to a defensive adjustment. The Red Sox rate as an excellent defense, the Tigers as a poor defensive team. Posnanski’s argument is that the Tigers played great defense behind Verlander, as his BABIP was .256 versus the team BABIP of .302. Forman’s argument is that the team’s defense is the team’s defense, and there’s no reason to think they got good because Verlander was on the mound. There is a lot of random variation in baseball, because nearly every set of individual stats is a small sample size.

When I worked on the Probabilistic Model of Range, one thing I liked to do was look at the expected outs a pitcher produced versus the actual outs recorded. Here’s a post from the end of the 2011 season, when Verlander won the Cy Young Award:

Notice that Justin Verlander and Josh Beckett both produced high expected DERs, but the defense magnified that by turning so many more batted balls into outs. Verlander pitched very well, but his defense gave him that extra push into the Cy Young.

Maybe that happened again. Maybe Verlander is just so good at inducing poor contact that even a bad defensive team looks good. I would love to know the expectations for Verlander’s and Porcello’s balls in play.

May 25, 2015

Crediting Pitchers and Fielders

1 Reply

Tom Tango and Phil Birnbaum discuss crediting pitchers and fielder properly on outs and hits on balls in play.

When I worked on the Probabilistic Model of Range (PMR), I liked to list PMR behind pitchers. The display would show the probability of the pitcher’s balls in play being turned into outs, and the actual percentage of outs recorded by the fielders. I ran it as a way of seeing pitchers who were helped or hurt by their defense, but it also showed which pitchers were doing a good job of inducing easy or tough to field balls in play.

It strikes me that this could be used as a way to properly weight credit to the pitcher and the fielder. Pitchers who induce easier to field batted balls simply get more credit that those who make life tough for the fielders. I’m sure some clever person could work this out.

March 2, 2014

Thought Experiment

5 Replies

Given that new defensive data will be available, how do we use it to measure range and positioning? Range in this case represents the ability to make plays by moving to a ball, positioning represents the ability to be in the right place at the right time.

A concept that will be useful to positioning for grounders and low line drives is shortest distance to the ball. Imagine the fielder standing at a fixed point on the field when the ball is put into play (not necessarily true, but a good first approximation). With home plate as the origin (the intersection of the foul lines), we can draw a line from the origin through the position of the fielder. We can also draw a line that represents the path of the ball. The shortest distance to the ball will be along a line from the fielder that intersects the path of the ball, perpendicular to the path of the ball.

Players who position themselves well should have a high probability of a low distance to the ball. That’s not quite enough, however. The straight away positions for the players are based on the distribution of batted balls, set where there is the highest local probability of a ball being hit. By standing at a straight away position, a fielder is giving himself a good chance to being close to a batted ball in his zone of opportunity.

The closeness probabilities for a fielder should be compared to the closeness probabilities if the fielder always played straight away. In other words, can a fielder to better than the default.

For balls higher in the air, we simply want to measure where the ball landed compared to where the fielder started, but once again we want to compare the result to a straightaway fielder.

Distance to the ball also comes into the range calculation. For that we want to measure the probability of making a play given the distance to the ball. The assumption is that the further away from the ball, the lower the probability of making a play. If we calculate the rate of decline in probability as a function of distance, players with great range should have a rate that is less negative. If this relationship was linear, we would say that players with great range have a less negative slope.

We could then rate players in a given season on a two dimensional scale of range and positioning. The fielders who performed best at PMR or UZR would likely do well in both positioning and range. Those at the other end might be poor at both positioning and range. Players with good range but poor positioning might look good, since their range can cover the weakness of position. Players with good position and bad range might look average. They would get bonuses for the times they were positioned well, but lose points for easy plays they were not close enough to convert.

Positioning may be a team skill more than a player skill. It will be interesting to see if we can separate out the two. For example, we might look for players who position themselves better than the team as a whole. We might look for players who are consistently good or bad from team to team. Do more experienced players position themselves better than less experienced players?

I look forward to seeing the research that falls out of this data.

March 22, 2012

More on Shifting

2 Replies

Earlier today I linked to a Bill James piece on using the shift against David Ortiz.

Having thought about it some more, this struck me as interesting:

Sounds a little low, but let’s go with it. To show the unique importance of David Ortiz in this discussion: The Shift has been used against Ortiz, over the two years, 486 times, not counting the strikeouts and walks and such like, but the most times that any team has USED the shift over the two years is 437, by Tampa Bay, and no other team is even over 300. It’s a definition of dominating a category. Remember all those articles pointing out that Babe Ruth hit more home runs than any other TEAM in 1920? Same thing; the number for an individual is greater than the number for any team. We should also remember to give The Fielding Bible credit for giving us hard data as to the number of times in a season Tampa Bay uses the shift.

Now, James discusses specifically using the shift against Ortiz, but there’s nothing special, as far as I can tell, about Ortiz as a left-handed pull hitter. Is he all that different from others who see the shift, like Mark Teixeira or Jason Giambi? My answer at the moment is I don’t know.

I do find it interesting, however, that Tampa Bay uses the shift extensively and ranks near the top in defensive metrics. The objective probabilistic model of range (PMR) rates them the best defensive team in the majors in 2011, as does UZR. Shifting for the Rays doesn’t seem to be hurting them, since they’re making plays most teams don’t. I suspect the Rays believe it helps, or they wouldn’t do it so often. I’d like to see if they always shift against specific batters, or if the batter/pitcher combination makes a difference.

March 8, 2012

Objective PMR, Pitcher Support

1 Reply

One nice thing about the Probabilistic Model of Range (PMR) is that it can be used to explore more than fielding skill. Applied to the defense behind a pitcher, it can be a measure of how much that hurler was helped or hurt by his defense. The following chart shows objective PMR stats for defenses with a particular pitcher on the mound in 2011, minimum 300 balls in play:

Objective PMR, performance behind pitchers. Model built on data from 2005-2010, visiting teams only.
Pitcher	In Play	Actual Outs	Predicted Outs	DER	Predicted DER	Index
Josh Tomlin	525	387	356.4	0.737	0.679	108.6
Josh Beckett	511	382	356.2	0.748	0.697	107.3
Justin Verlander	635	477	445.5	0.751	0.702	107.1
Jeremy Hellickson	560	431	404.4	0.770	0.722	106.6
Randy Wolf	636	447	420.0	0.703	0.660	106.4
Doug Fister	669	478	453.7	0.714	0.678	105.4
Ricky Romero	620	457	434.8	0.737	0.701	105.1
Kyle Lohse	525	375	357.3	0.714	0.681	105.0
Kyle Kendrick	346	251	239.1	0.725	0.691	105.0
Jeff Niemann	407	292	278.4	0.717	0.684	104.9
Randy Wells	393	277	264.2	0.705	0.672	104.8
Chris Narveson	473	328	313.6	0.693	0.663	104.6
Blake Beavan	332	235	224.9	0.708	0.678	104.5
Joe Saunders	652	469	448.8	0.719	0.688	104.5
Ervin Santana	665	474	454.3	0.713	0.683	104.3
James Shields	655	478	458.4	0.730	0.700	104.3
Fausto Carmona	606	420	403.2	0.693	0.665	104.2
Gavin Floyd	569	405	389.0	0.712	0.684	104.1
Jered Weaver	649	481	462.0	0.741	0.712	104.1
Ian Kennedy	563	405	389.0	0.719	0.691	104.1
Jhoulys Chacin	502	363	349.3	0.723	0.696	103.9
Alexi Ogando	500	360	347.0	0.720	0.694	103.7
Tommy Hanson	316	225	217.0	0.712	0.687	103.7
Carlos Villanueva	339	242	233.6	0.714	0.689	103.6
Johnny Cueto	428	310	299.5	0.724	0.700	103.5
Alfredo Aceves	326	246	238.0	0.755	0.730	103.4
Javier Vazquez	549	388	375.1	0.707	0.683	103.4
Luke Hochevar	615	440	425.6	0.715	0.692	103.4
Philip Humber	499	357	345.8	0.715	0.693	103.2
Cole Hamels	542	391	378.9	0.721	0.699	103.2
Dan Haren	703	502	486.9	0.714	0.693	103.1
Ivan Nova	530	371	360.0	0.700	0.679	103.1
Jeremy Guthrie	629	445	431.5	0.707	0.686	103.1
Bronson Arroyo	650	464	450.2	0.714	0.693	103.1
Carlos Carrasco	368	256	248.6	0.696	0.675	103.0
Jair Jurrjens	453	327	317.6	0.722	0.701	103.0
Guillermo Moscoso	398	300	292.2	0.754	0.734	102.7
David Price	606	428	416.9	0.706	0.688	102.7
Tim Stauffer	524	372	362.2	0.710	0.691	102.7
Tim Hudson	612	434	423.6	0.709	0.692	102.5
Chris Carpenter	664	445	434.1	0.670	0.654	102.5
Wandy Rodriguez	505	350	342.1	0.693	0.677	102.3
Jordan Lyles	303	204	199.4	0.673	0.658	102.3
Freddy Garcia	466	323	316.0	0.693	0.678	102.2
Jason Hammel	531	371	363.5	0.699	0.685	102.1
Homer Bailey	386	270	264.4	0.699	0.685	102.1
Ted Lilly	516	375	367.2	0.727	0.712	102.1
Carlos Zambrano	437	304	297.7	0.696	0.681	102.1
John Lannan	568	385	377.9	0.678	0.665	101.9
Cliff Lee	594	417	409.3	0.702	0.689	101.9
Josh Collmenter	456	338	331.7	0.741	0.727	101.9
Vance Worley	355	250	245.4	0.704	0.691	101.9
J.A. Happ	411	279	274.0	0.679	0.667	101.8
Dillon Gee	465	330	324.5	0.710	0.698	101.7
Michael Pineda	446	324	319.5	0.726	0.716	101.4
Jon Lester	491	347	342.5	0.707	0.697	101.3
Jake Peavy	338	230	227.3	0.680	0.673	101.2
Mike Leake	454	319	315.1	0.703	0.694	101.2
Dustin Moseley	376	264	261.0	0.702	0.694	101.1
Kyle McClellan	457	323	319.5	0.707	0.699	101.1
Bud Norris	468	324	320.9	0.692	0.686	101.0
Shaun Marcum	532	381	377.6	0.716	0.710	100.9
C.J. Wilson	609	422	418.4	0.693	0.687	100.9
Bruce Chen	482	341	338.4	0.707	0.702	100.8
Scott Baker	375	261	258.9	0.696	0.691	100.8
Brandon McCarthy	531	365	362.5	0.687	0.683	100.7
Edwin Jackson	613	402	399.5	0.656	0.652	100.6
Livan Hernandez	563	382	379.7	0.679	0.674	100.6
Colby Lewis	571	414	411.4	0.725	0.720	100.6
Justin Masterson	643	436	433.7	0.678	0.675	100.5
A.J. Burnett	541	370	368.7	0.684	0.682	100.4
Brett Cecil	376	272	270.9	0.723	0.720	100.4
Tim Wakefield	482	343	341.6	0.712	0.709	100.4
Wade Davis	596	422	420.4	0.708	0.705	100.4
Brandon Morrow	471	320	318.6	0.679	0.677	100.4
Hiroki Kuroda	566	399	397.2	0.705	0.702	100.4
Carl Pavano	753	509	507.0	0.676	0.673	100.4
Anthony Swarzak	346	238	237.4	0.688	0.686	100.3
Chris Volstad	512	344	343.1	0.672	0.670	100.3
Erik Bedard	353	243	242.4	0.688	0.687	100.3
Jake Westbrook	574	387	385.9	0.674	0.672	100.3
Jason Marquis	405	274	274.4	0.677	0.677	99.9
Jordan Zimmermann	466	325	325.2	0.697	0.698	99.9
Jake Arrieta	330	234	234.3	0.709	0.710	99.9
Ryan Dempster	564	376	376.4	0.667	0.667	99.9
Clayton Kershaw	540	383	383.9	0.709	0.711	99.8
Gio Gonzalez	532	369	370.2	0.694	0.696	99.7
Matt Cain	303	217	217.6	0.716	0.718	99.7
Jason Vargas	640	445	446.6	0.695	0.698	99.6
Joel Pineiro	513	340	341.4	0.663	0.666	99.6
Tyler Chatwood	468	312	313.4	0.667	0.670	99.5
Zachary Britton	475	324	325.5	0.682	0.685	99.5
Chad Billingsley	513	350	352.2	0.682	0.687	99.4
Brett Myers	604	414	416.3	0.685	0.689	99.4
Derek Holland	586	394	396.7	0.672	0.677	99.3
Mark Buehrle	681	472	475.2	0.693	0.698	99.3
Felix Hernandez	649	440	443.8	0.678	0.684	99.1
R.A. Dickey	611	429	433.1	0.702	0.709	99.1
Francisco Liriano	383	267	269.8	0.697	0.704	99.0
Trevor Cahill	634	428	433.3	0.675	0.683	98.8
Roy Halladay	650	449	454.9	0.691	0.700	98.7
Matt Harrison	574	393	398.4	0.685	0.694	98.6
Nick Blackburn	497	324	328.9	0.652	0.662	98.5
Zack Greinke	425	279	283.4	0.656	0.667	98.4
Bartolo Colon	495	337	342.7	0.681	0.692	98.3
Ubaldo Jimenez	525	348	354.5	0.663	0.675	98.2
Yovani Gallardo	554	382	389.2	0.690	0.703	98.2
Brad Penny	623	420	428.0	0.674	0.687	98.1
Mike Pelfrey	624	425	433.7	0.681	0.695	98.0
Chris Capuano	517	346	353.4	0.669	0.684	97.9
CC Sabathia	670	443	454.1	0.661	0.678	97.6
Anibal Sanchez	500	332	340.3	0.664	0.681	97.6
Max Scherzer	551	373	382.5	0.677	0.694	97.5
Jaime Garcia	570	377	387.2	0.661	0.679	97.4
Brad Bergesen	332	225	231.4	0.678	0.697	97.2
John Danks	521	352	362.6	0.676	0.696	97.1
Madison Bumgarner	300	203	209.1	0.677	0.697	97.1
Daniel Hudson	595	402	415.0	0.676	0.697	96.9
Mat Latos	481	335	345.9	0.696	0.719	96.9
Clayton Richard	314	211	217.7	0.672	0.693	96.9
Jo-Jo Reyes	479	317	327.6	0.662	0.684	96.8
Matt Garza	523	347	358.3	0.663	0.685	96.8
Ricky Nolasco	634	414	427.5	0.653	0.674	96.8
Aaron Cook	343	223	230.3	0.650	0.671	96.8
Brian Duensing	502	330	341.8	0.657	0.681	96.6
Rick Porcello	587	388	401.7	0.661	0.684	96.6
John Lackey	540	351	363.9	0.650	0.674	96.5
Derek Lowe	607	397	411.2	0.654	0.677	96.5
Alfredo Simon	357	238	246.7	0.667	0.691	96.5
Aaron Harang	477	334	346.3	0.700	0.726	96.5
Brandon Beachy	342	231	239.7	0.675	0.701	96.4
Jeff Francis	649	433	449.6	0.667	0.693	96.3
Phil Coke	356	233	242.2	0.654	0.680	96.2
Jonathon Niese	494	320	333.1	0.648	0.674	96.1
Danny Duffy	316	209	218.7	0.661	0.692	95.6
Roy Oswalt	430	294	310.6	0.684	0.722	94.7
Felipe Paulino	380	248	263.1	0.653	0.692	94.3

Notice that Justin Verlander and Josh Beckett both producEd High expected DERs, but the defense magnified that by turning so many more batted balls into outs. Verlander pitched very well, but his defense gave him that extra push into the Cy Young.

At the other end of the spectrum, the Phillies defense hurt Roy Oswalt. His balls in play were relatively easy to field, but the Philadelphia defense let Roy down. A team with a good defense might do well to lure Oswalt to their mound.

Those predicted DER numbers should not be taken as an absolute, however. Parks play a big factor in those. We would expect them to be lower in Coors, for example, than in PETCO. So the index represents how much the defense helped or hurt the pitcher, but the predicted DERs should not be used by themselves to rate the hurler.

In objective PMR, where we don’t measure the direction or the speed of the ball off the bat, it could be that someone like Justin Verlander was helping his defense, instead of the other way around. When batters have trouble making contact against a pitcher, they probably have trouble making good contact as well. It’s possible that the balls in play against Justin were simply easier to field.

If you like the objective PMR series, please consider donating to the Baseball Musings Pledge Drive.

March 7, 2012

Objective PMR, Pitchers

2 Replies

The series on objective probabilistic model of range (PMR) continues by looking at pitchers. Like catchers, range is a small component of a pitcher’s defensive value. I’ll show teams as a whole at the position, plus individuals who were on the field for 300 balls in play. First the teams:

Objective PMR, team pitchers, 2011. Model built on data from 2005-2010, visiting teams only.
Team	In Play	Actual Outs	Predicted Outs	Actual DER	Predicted DER	Index
SDN	3083	217	173.1	0.070	0.056	125.4
CLE	3389	200	166.5	0.059	0.049	120.1
LAN	2666	195	163.9	0.073	0.061	118.9
ARI	2960	183	158.3	0.062	0.053	115.6
COL	3053	213	187.4	0.070	0.061	113.7
NYN	3089	224	199.9	0.073	0.065	112.1
BAL	3180	189	169.7	0.059	0.053	111.4
ATL	2960	214	195.1	0.072	0.066	109.7
TOR	3106	173	160.9	0.056	0.052	107.5
MIL	2938	188	175.8	0.064	0.060	107.0
KCA	3241	192	181.6	0.059	0.056	105.7
OAK	3163	167	159.8	0.053	0.051	104.5
TBA	3010	153	146.6	0.051	0.049	104.3
SFN	1447	91	87.9	0.063	0.061	103.5
CHA	3292	190	184.2	0.058	0.056	103.2
SEA	3408	170	165.3	0.050	0.049	102.8
FLO	3114	179	178.4	0.057	0.057	100.3
ANA	3147	166	168.6	0.053	0.054	98.5
CHN	3002	167	172.2	0.056	0.057	97.0
SLN	3192	178	186.4	0.056	0.058	95.5
PIT	1587	90	94.6	0.057	0.060	95.2
BOS	2956	134	141.0	0.045	0.048	95.1
DET	3346	149	157.7	0.045	0.047	94.5
PHI	3053	169	179.0	0.055	0.059	94.4
TEX	3166	153	162.6	0.048	0.051	94.1
CIN	3046	160	180.7	0.053	0.059	88.5
HOU	2882	164	186.5	0.057	0.065	88.0
NYA	3251	163	186.4	0.050	0.057	87.5
MIN	2983	166	190.7	0.056	0.064	87.0
WAS	3085	172	200.5	0.056	0.065	85.8

It’s tough to get a ball through the Padres mound.

The individuals:

Objective PMR, individual pitchers, 2011. Model built on data from 2005-2010, visiting teams only. 300 balls in play, minimum.
Fielder	In Play	Actual Outs	Predicted Outs	Actual DER	Predicted DER	Index
R.A. Dickey	452	52	29.5	0.115	0.065	176.1
Tim Stauffer	453	45	25.8	0.099	0.057	174.6
Jeremy Guthrie	446	35	21.6	0.078	0.048	162.2
Ryan Dempster	439	38	24.1	0.087	0.055	157.9
Josh Tomlin	382	26	17.0	0.068	0.045	152.6
Hiroki Kuroda	436	38	26.3	0.087	0.060	144.7
Doug Fister	566	37	25.9	0.065	0.046	142.8
Livan Hernandez	407	35	24.9	0.086	0.061	140.3
Jake Westbrook	476	41	29.2	0.086	0.061	140.2
Justin Verlander	495	30	21.4	0.061	0.043	140.0
Jair Jurrjens	300	27	19.4	0.090	0.065	139.3
Jhoulys Chacin	392	37	26.9	0.094	0.069	137.7
Dustin Moseley	313	24	17.5	0.077	0.056	137.0
Jeff Niemann	317	22	16.1	0.069	0.051	136.8
Clayton Kershaw	355	30	22.0	0.085	0.062	136.5
Mat Latos	375	26	19.3	0.069	0.052	134.5
Jonathon Niese	375	36	27.8	0.096	0.074	129.6
Bruce Chen	322	24	18.5	0.075	0.058	129.5
Josh Collmenter	313	18	14.2	0.058	0.045	126.5
Matt Harrison	418	31	25.0	0.074	0.060	124.0
Randy Wolf	456	30	24.3	0.066	0.053	123.3
Dan Haren	512	32	26.0	0.063	0.051	123.2
Bronson Arroyo	476	32	26.0	0.067	0.055	122.8
Joe Saunders	474	35	28.7	0.074	0.060	122.1
Ricky Romero	432	35	28.7	0.081	0.066	122.0
Michael Pineda	352	18	14.8	0.051	0.042	121.6
Shaun Marcum	378	25	20.8	0.066	0.055	120.4
Jason Hammel	408	28	23.6	0.069	0.058	118.6
Carl Pavano	534	37	31.3	0.069	0.059	118.4
Ted Lilly	317	22	18.7	0.069	0.059	117.6
Ian Kennedy	405	23	19.6	0.057	0.048	117.3
Javier Vazquez	387	22	19.0	0.057	0.049	115.9
Bud Norris	341	24	20.9	0.070	0.061	114.8
Justin Masterson	522	31	27.2	0.059	0.052	114.1
Yovani Gallardo	411	28	25.1	0.068	0.061	111.8
Josh Beckett	363	19	17.1	0.052	0.047	111.2
Edwin Jackson	492	27	24.3	0.055	0.049	111.1
Mark Buehrle	503	35	31.6	0.070	0.063	110.6
Anibal Sanchez	387	24	21.7	0.062	0.056	110.4
Ubaldo Jimenez	400	25	22.8	0.063	0.057	109.6
Freddy Garcia	364	19	17.4	0.052	0.048	109.4
Felix Hernandez	556	30	27.4	0.054	0.049	109.3
Kyle Lohse	380	22	20.3	0.058	0.053	108.5
James Shields	489	27	24.9	0.055	0.051	108.3
Brandon McCarthy	424	22	20.3	0.052	0.048	108.3
Tim Wakefield	330	17	15.7	0.052	0.048	108.0
David Price	425	23	21.6	0.054	0.051	106.6
Chris Narveson	348	22	20.7	0.063	0.059	106.3
Gio Gonzalez	392	22	21.0	0.056	0.053	104.9
Nick Blackburn	380	23	22.0	0.061	0.058	104.7
Zachary Britton	360	23	22.2	0.064	0.062	103.6
Jeff Francis	468	31	30.0	0.066	0.064	103.4
Tyler Chatwood	330	19	18.4	0.058	0.056	103.1
Brad Penny	519	25	24.4	0.048	0.047	102.5
Fausto Carmona	497	27	26.5	0.054	0.053	101.8
Chris Capuano	358	25	24.7	0.070	0.069	101.0
Jason Vargas	448	22	21.9	0.049	0.049	100.6
Zack Greinke	329	20	20.0	0.061	0.061	99.9
Derek Holland	438	25	25.1	0.057	0.057	99.6
Chad Billingsley	378	24	24.1	0.063	0.064	99.4
Dillon Gee	346	21	21.2	0.061	0.061	99.2
John Lackey	405	17	17.2	0.042	0.042	98.8
John Danks	394	23	23.4	0.058	0.059	98.5
Tim Hudson	485	34	34.9	0.070	0.072	97.3
Trevor Cahill	532	28	28.8	0.053	0.054	97.2
Brandon Morrow	342	14	14.4	0.041	0.042	97.1
Jered Weaver	422	18	18.7	0.043	0.044	96.1
Derek Lowe	500	33	34.3	0.066	0.069	96.1
Chris Carpenter	531	27	28.3	0.051	0.053	95.5
Roy Halladay	525	29	30.6	0.055	0.058	94.9
Daniel Hudson	434	21	22.1	0.048	0.051	94.9
Luke Hochevar	476	25	26.4	0.053	0.055	94.7
Cole Hamels	418	25	27.0	0.060	0.065	92.5
Bartolo Colon	390	19	20.6	0.049	0.053	92.3
Mike Pelfrey	469	26	28.2	0.055	0.060	92.2
Wade Davis	413	17	18.9	0.041	0.046	90.0
C.J. Wilson	446	23	25.9	0.052	0.058	88.8
Roy Oswalt	327	17	19.2	0.052	0.059	88.4
Jeremy Hellickson	395	15	17.0	0.038	0.043	88.4
Johnny Cueto	335	19	21.5	0.057	0.064	88.4
Jo-Jo Reyes	321	16	18.1	0.050	0.056	88.3
John Lannan	457	28	31.9	0.061	0.070	87.8
Gavin Floyd	437	20	22.9	0.046	0.052	87.5
Cliff Lee	440	23	26.7	0.052	0.061	86.2
Ricky Nolasco	476	24	27.9	0.050	0.059	85.9
Ervin Santana	462	21	24.6	0.045	0.053	85.4
Aaron Harang	374	16	19.1	0.043	0.051	83.9
Jordan Zimmermann	314	16	19.1	0.051	0.061	83.6
Ivan Nova	438	22	27.2	0.050	0.062	80.7
Jon Lester	352	16	20.0	0.045	0.057	80.1
Rick Porcello	501	20	25.0	0.040	0.050	80.0
Chris Volstad	412	20	25.1	0.049	0.061	79.8
Joel Pineiro	384	16	20.3	0.042	0.053	78.9
Mike Leake	346	17	21.9	0.049	0.063	77.8
Matt Garza	418	18	23.3	0.043	0.056	77.2
Kyle McClellan	343	16	20.8	0.047	0.061	77.0
Philip Humber	383	16	20.9	0.042	0.055	76.4
Wandy Rodriguez	356	18	25.2	0.051	0.071	71.4
Alexi Ogando	414	12	17.2	0.029	0.042	69.7
A.J. Burnett	449	18	26.6	0.040	0.059	67.7
Colby Lewis	423	12	18.3	0.028	0.043	65.5
Brett Myers	445	19	29.0	0.043	0.065	65.5
CC Sabathia	430	20	30.7	0.047	0.071	65.2
Brian Duensing	345	16	26.6	0.046	0.077	60.1
Carlos Zambrano	331	10	17.9	0.030	0.054	55.9
Jaime Garcia	452	16	30.6	0.035	0.068	52.3
Jason Marquis	318	11	21.3	0.035	0.067	51.5
Max Scherzer	448	9	19.1	0.020	0.043	47.0

Like catchers, the small number of balls that are fielded by pitchers means you should take these numbers with a big grain of salt. It is interesting to note that with all his other problems, A.J. Burnett did little with his glove to help himself. On the other hand, few on the Yankees staff shone on defense.

Their cross town rivals, the Mets, did very well. I wonder if R.A.Dickey’s knuckle ball induces a lot of easy comebackers?

March 6, 2012

Objective PMR, Catchers

Leave a reply

The series on objective probabilistic model of range (PMR) continues by looking at catchers. Compared to the players standing behind the pitcher, range is a small component of a catcher’s defensive value. I’ll show teams as a whole at the position, plus individuals who were on the field for 750 balls in play. First the teams:

Objective PMR, team catchers, 2011. Model built on data from 2005-2010, visiting teams only.
Team	In Play	Actual Outs	Predicted Outs	Actual DER	Predicted DER	Index
BAL	2754	59	45.8	0.021	0.017	128.8
TBA	3195	53	44.2	0.017	0.014	120.0
SEA	3171	55	46.2	0.017	0.015	119.2
PHI	3274	68	57.9	0.021	0.018	117.4
ANA	2794	49	42.5	0.018	0.015	115.3
CLE	2935	49	42.9	0.017	0.015	114.1
TEX	2721	51	45.7	0.019	0.017	111.6
OAK	2981	55	49.8	0.018	0.017	110.4
SLN	3294	56	50.8	0.017	0.015	110.2
FLO	3326	59	54.6	0.018	0.016	108.1
TOR	2830	44	41.4	0.016	0.015	106.2
NYN	3445	62	59.4	0.018	0.017	104.3
CHN	3084	53	50.9	0.017	0.017	104.1
SFN	1589	28	27.0	0.018	0.017	103.8
KCA	2889	45	43.7	0.016	0.015	103.1
MIL	3374	52	50.7	0.015	0.015	102.5
ARI	3269	61	59.9	0.019	0.018	101.8
ATL	3342	49	48.2	0.015	0.014	101.6
LAN	3015	58	57.2	0.019	0.019	101.3
BOS	2847	44	43.9	0.015	0.015	100.3
NYA	2855	46	47.8	0.016	0.017	96.2
HOU	3148	50	53.0	0.016	0.017	94.4
SDN	2957	49	52.0	0.017	0.018	94.3
DET	2573	36	38.3	0.014	0.015	94.0
MIN	2709	43	47.2	0.016	0.017	91.1
PIT	1679	26	28.9	0.015	0.017	90.1
WAS	3303	48	53.4	0.015	0.016	89.9
CIN	3267	50	59.4	0.015	0.018	84.2
COL	3401	40	51.8	0.012	0.015	77.3
CHA	2829	34	47.3	0.012	0.017	72.0

Note that catchers are involved in a small number of plays, so in addition to range being a small part of their value, the number for them are going to be highly variable. That said, the Orioles combination of Matt Wieters and Craig Tatum did a nice job for the Orioles. The White Sox, on the other hand did not get much range from their catchers. A.J. Pierzynski and Ramon Castro were both rather old.

The individuals:

Objective PMR, individual catchers, 2011. Model built on data from 2005-2010, visiting teams only. 750 balls in play, minimum.
Fielder	In Play	Actual Outs	Predicted Outs	Actual DER	Predicted DER	Index
Rob Johnson	1025	26	18.7	0.025	0.018	139.2
Lou Marson	1365	28	20.2	0.021	0.015	138.3
Miguel Olivo	2331	46	33.3	0.020	0.014	138.2
Mike Napoli	945	21	15.4	0.022	0.016	136.8
Josh Thole	1853	44	32.4	0.024	0.017	135.9
J.P. Arencibia	2048	39	30.2	0.019	0.015	129.1
Hank Conger	845	16	12.8	0.019	0.015	125.2
Geovany Soto	2218	44	36.2	0.020	0.016	121.5
Matt Wieters	2128	41	34.5	0.019	0.016	118.9
Miguel Montero	2641	56	48.4	0.021	0.018	115.6
Carlos Ruiz	2339	47	42.3	0.020	0.018	111.2
Ramon Hernandez	1491	29	26.3	0.019	0.018	110.1
Russell Martin	2075	37	33.6	0.018	0.016	110.1
Rod Barajas	1522	32	29.2	0.021	0.019	109.6
Kurt Suzuki	2327	43	39.3	0.018	0.017	109.5
J.R. Towles	857	16	14.8	0.019	0.017	108.0
Yorvit Torrealba	1569	29	27.0	0.018	0.017	107.3
Kelly Shoppach	1384	22	20.6	0.016	0.015	106.7
Brian McCann	2450	37	34.7	0.015	0.014	106.5
John Jaso	1329	19	17.9	0.014	0.013	106.2
John Buck	2572	45	42.5	0.017	0.017	105.9
Dioner Navarro	912	18	17.2	0.020	0.019	104.9
Jeff Mathis	1341	22	21.3	0.016	0.016	103.3
Yadier Molina	2604	41	40.2	0.016	0.015	102.1
Jarrod Saltalamacchia	1734	27	26.6	0.016	0.015	101.5
Matt Treanor	1144	17	17.1	0.015	0.015	99.5
Alex Avila	2058	30	30.7	0.015	0.015	97.8
Humberto Quintero	1438	23	23.8	0.016	0.017	96.8
Jason Varitek	1069	16	16.8	0.015	0.016	95.4
David Ross	850	12	12.6	0.014	0.015	95.3
Carlos Santana	1570	21	22.7	0.013	0.014	92.5
Drew Butera	1341	21	23.3	0.016	0.017	90.2
Wilson Ramos	2216	31	35.2	0.014	0.016	88.0
Brayan Pena	1068	13	15.4	0.012	0.014	84.7
A.J. Pierzynski	1972	27	32.6	0.014	0.017	82.9
Jonathan Lucroy	2365	29	35.1	0.012	0.015	82.6
Carlos Corporan	815	11	13.8	0.013	0.017	80.0
Chris Iannetta	2208	26	33.2	0.012	0.015	78.3
Ronny Paulino	1239	15	20.3	0.012	0.016	73.9
Nick Hundley	1271	17	23.0	0.013	0.018	73.8
Koyie Hill	756	8	12.9	0.011	0.017	62.0
Ryan Hanigan	1530	16	28.3	0.010	0.018	56.6
Jose Molina	782	5	11.2	0.006	0.014	44.5

You can see that indeed, Wieters did a good job behind the plate, although Miguel Olivo comes out on top among starters. Mike Napoli’s good result shows how a small sample size can fool you, however. I don’t think most observers seem him as a great defensive catcher. Take this particular category with a grain of salt.

The information used here was obtained free of charge from and is copyrighted by Retrosheet. Interested parties may contact Retrosheet at 20 Sunset Rd., Newark, DE 19711.

March 1, 2012

Objective PMR, First Basemen

1 Reply

The series on objective probabilistic model of range (PMR) continues by looking at first basemen. I’ll show teams as a whole at the position, plus individuals who were on the field for 1000 balls in play. First the teams:

Objective PMR, team first basemen, 2011. Model built on data from 2005-2010, visiting teams only.
Team	In Play	Actual Outs	Predicted Outs	Actual DER	Predicted DER	Index
NYN	3180	350	283.0	0.110	0.089	123.7
LAN	2739	289	251.6	0.106	0.092	114.9
OAK	3163	347	306.0	0.110	0.097	113.4
BOS	2977	341	302.4	0.115	0.102	112.8
ANA	3160	341	317.1	0.108	0.100	107.5
SLN	3192	300	281.5	0.094	0.088	106.6
HOU	2958	288	270.1	0.097	0.091	106.6
MIN	3124	308	289.3	0.099	0.093	106.5
CIN	3052	308	294.7	0.101	0.097	104.5
WAS	3166	307	297.1	0.097	0.094	103.3
SFN	1445	142	139.7	0.098	0.097	101.7
COL	3038	305	302.0	0.100	0.099	101.0
CLE	3338	326	325.2	0.098	0.097	100.2
FLO	3111	295	297.4	0.095	0.096	99.2
NYA	3169	297	301.4	0.094	0.095	98.5
SEA	3178	284	292.0	0.089	0.092	97.3
KCA	3210	291	300.5	0.091	0.094	96.8
TBA	2988	298	309.7	0.100	0.104	96.2
PHI	3079	288	305.0	0.094	0.099	94.4
MIL	2966	271	289.0	0.091	0.097	93.8
CHA	3266	278	298.1	0.085	0.091	93.3
CHN	3016	252	275.7	0.084	0.091	91.4
BAL	3161	279	306.4	0.088	0.097	91.1
ATL	2968	268	295.6	0.090	0.100	90.6
TOR	3194	268	297.7	0.084	0.093	90.0
SDN	2887	254	283.6	0.088	0.098	89.6
DET	3103	276	310.2	0.089	0.100	89.0
PIT	1612	128	147.0	0.079	0.091	87.1
ARI	2953	242	278.6	0.082	0.094	86.8
TEX	3038	234	273.3	0.077	0.090	85.6

We found a defense position where the Mets are good! Of course, it’s the least important position in terms of defense. Oakland’s great defense at the position hardly made up for their combined .219/.294/.316 batting line in 2011.

The individuals:

Objective PMR, individual first basemen, 2011. Model built on data from 2005-2010, visiting teams only. 1000 balls in play, minimum.
Fielder	In Play	Actual Outs	Predicted Outs	Actual DER	Predicted DER	Index
Daric Barton	1227	149	120.2	0.121	0.098	124.0
James Loney	2331	251	212.2	0.108	0.091	118.3
Adrian Gonzalez	2768	316	279.7	0.114	0.101	113.0
Mark Trumbo	2705	302	274.3	0.112	0.101	110.1
Albert Pujols	2732	263	240.0	0.096	0.088	109.6
Todd Helton	2140	226	211.7	0.106	0.099	106.7
Carlos Lee	1240	122	114.4	0.098	0.092	106.6
Joey Votto	2964	302	286.2	0.102	0.097	105.5
Brett Wallace	1634	157	149.2	0.096	0.091	105.3
Carlos Santana	1299	129	123.4	0.099	0.095	104.5
Mark Teixeira	2788	272	265.7	0.098	0.095	102.4
Gaby Sanchez	2899	275	277.3	0.095	0.096	99.2
Eric Hosmer	2482	227	234.3	0.091	0.094	96.9
Justin Smoak	1970	178	183.9	0.090	0.093	96.8
Matt LaPorta	1880	179	185.6	0.095	0.099	96.4
Mike Morse	1593	145	150.4	0.091	0.094	96.4
Ryan Howard	2724	262	274.6	0.096	0.101	95.4
Prince Fielder	2850	263	278.6	0.092	0.098	94.4
Paul Konerko	2221	185	201.0	0.083	0.090	92.1
Carlos Pena	2666	224	243.4	0.084	0.091	92.0
Derrek Lee	1789	166	180.4	0.093	0.101	92.0
Mitch Moreland	1715	142	156.4	0.083	0.091	90.8
Justin Morneau	1074	94	104.6	0.088	0.097	89.9
Casey Kotchman	2492	233	259.5	0.093	0.104	89.8
Adam Lind	2115	178	198.2	0.084	0.094	89.8
Freddie Freeman	2752	243	272.2	0.088	0.099	89.3
Lyle Overbay	1191	93	109.0	0.078	0.092	85.3
Miguel Cabrera	2864	242	286.4	0.084	0.100	84.5

It looks like Prince Fielder will be an improvement over Miguel Cabrera at first base for the Tigers, but no range at first may be terrible range at third base. Albert Pujols may have lost a step, as he is usually head and shoulders above the other first basemen defensively. He remains very good, however, especially given his bat. Adrian Gonzalez may have moved ahead of Albert in terms of total package, offense and defense, among first basemen.

The Baseball Musings Pledge Drive continues through March. Please make a donation!

February 29, 2012

Objective PMR, Leftfielders

7 Replies

The series on objective probabilistic model of range (PMR) continues by looking at leftfielders. I’ll show teams as a whole at the position, plus individuals who were on the field for 1000 balls in play. First the teams:

Objective PMR, team leftfielders, 2011. Model built on data from 2005-2010, visiting teams only.
Team	In Play	Actual Outs	Predicted Outs	Actual DER	Predicted DER	Index
NYA	2947	363	297.9	0.123	0.101	121.8
ARI	2870	335	304.0	0.117	0.106	110.2
TBA	3455	358	325.0	0.104	0.094	110.1
KCA	3740	353	323.7	0.094	0.087	109.1
LAN	2766	272	253.8	0.098	0.092	107.2
WAS	2546	305	288.1	0.120	0.113	105.9
SEA	3586	350	331.2	0.098	0.092	105.7
ATL	2983	285	270.5	0.096	0.091	105.4
DET	3106	349	332.2	0.112	0.107	105.0
SFN	1492	150	143.3	0.101	0.096	104.7
BOS	3106	313	302.9	0.101	0.098	103.3
HOU	3304	259	252.8	0.078	0.076	102.5
MIL	3285	263	257.8	0.080	0.078	102.0
ANA	3816	350	343.5	0.092	0.090	101.9
CIN	3424	290	285.0	0.085	0.083	101.8
TEX	3116	309	305.7	0.099	0.098	101.1
MIN	2807	330	329.1	0.118	0.117	100.3
BAL	3638	353	356.7	0.097	0.098	99.0
CHA	3520	310	317.1	0.088	0.090	97.8
CLE	3817	319	329.0	0.084	0.086	97.0
SDN	2880	273	282.5	0.095	0.098	96.6
PIT	1662	128	133.5	0.077	0.080	95.9
FLO	3273	282	294.7	0.086	0.090	95.7
NYN	2664	276	290.0	0.104	0.109	95.2
TOR	3572	297	313.1	0.083	0.088	94.9
CHN	2998	247	262.5	0.082	0.088	94.1
PHI	2507	242	257.1	0.097	0.103	94.1
OAK	3261	285	310.9	0.087	0.095	91.7
SLN	3294	262	287.6	0.080	0.087	91.1
COL	3336	220	263.5	0.066	0.079	83.5

I can’t say I’m surprised to see the Yankees and Rays near the top. Brett Gardner is known for his defense, and Sam Fuld made some amazing plays for the Rays before Desmond Jennings moved into the lineup.

The individuals:

Objective PMR, individual leftfielders, 2011. Model built on data from 2005-2010, visiting teams only. 1000 balls in play, minimum.
Fielder	In Play	Actual Outs	Predicted Outs	Actual DER	Predicted DER	Index
Brett Gardner	2387	294	237.4	0.123	0.099	123.9
Gerardo Parra	2051	244	216.6	0.119	0.106	112.7
Vernon Wells	2408	241	214.8	0.100	0.089	112.2
Sam Fuld	1505	162	146.7	0.108	0.097	110.4
Josh Hamilton	1472	153	140.7	0.104	0.096	108.7
Ryan Braun	2826	237	219.8	0.084	0.078	107.8
Alex Gordon	3419	313	292.1	0.092	0.085	107.2
Martin Prado	1656	165	158.4	0.100	0.096	104.2
Tony Gwynn	1165	110	107.4	0.094	0.092	102.5
Nolan Reimold	1524	146	143.4	0.096	0.094	101.8
Jonny Gomes	1269	112	110.8	0.088	0.087	101.1
Desmond Jennings	1000	100	98.9	0.100	0.099	101.1
Carl Crawford	2342	235	233.2	0.100	0.100	100.8
Carlos Lee	1509	113	114.6	0.075	0.076	98.6
Delmon Young	1886	208	214.2	0.110	0.114	97.1
Jason Bay	1922	204	210.3	0.106	0.109	97.0
David Murphy	1278	123	127.0	0.096	0.099	96.8
Logan Morrison	2295	197	207.9	0.086	0.091	94.7
Juan Pierre	3282	278	295.0	0.085	0.090	94.2
Michael Brantley	1396	114	121.5	0.082	0.087	93.8
Alfonso Soriano	2062	169	181.2	0.082	0.088	93.3
Raul Ibanez	1983	195	209.1	0.098	0.105	93.2
Eric Thames	1132	91	99.1	0.080	0.088	91.8
Josh Willingham	1862	155	170.5	0.083	0.092	90.9
Ryan Ludwick	1864	155	173.6	0.083	0.093	89.3
Matt Holliday	2310	182	204.0	0.079	0.088	89.2
Carlos Gonzalez	1094	76	87.4	0.069	0.080	86.9

Gardner and Fuld were indeed very good, and Desmond Jennings held his own. I’m impressed with Carlos Lee’s ranking. At this point, I would think he would be old and slow, but it looks like he gets to most of the balls that he should catch. Juan Pierre, Alfonso Soriano, Raul Ibanez, and Matt Holliday all lost a step.

February 29, 2012

Objective PMR, Rightfielders

Leave a reply

The series on objective probabilistic model of range (PMR) continues by looking at rightfielders. I’ll show teams as a whole at the position, plus individuals who were on the field for 1000 balls in play. First the teams:

Objective PMR, team rightfielders, 2011. Model built on data from 2005-2010, visiting teams only.
Team	In Play	Actual Outs	Predicted Outs	Actual DER	Predicted DER	Index
CLE	3675	356	298.5	0.097	0.081	119.2
CHN	3301	329	296.1	0.100	0.090	111.1
FLO	3130	321	289.3	0.103	0.092	110.9
OAK	2719	331	306.7	0.122	0.113	107.9
SDN	3411	315	293.1	0.092	0.086	107.5
TOR	3101	343	322.8	0.111	0.104	106.2
ARI	3522	322	303.1	0.091	0.086	106.2
MIN	3014	334	316.1	0.111	0.105	105.7
NYA	3035	332	319.4	0.109	0.105	104.0
SLN	3538	313	302.1	0.088	0.085	103.6
HOU	3433	307	297.6	0.089	0.087	103.2
BOS	3337	326	316.8	0.098	0.095	102.9
WAS	2956	329	320.6	0.111	0.108	102.6
CHA	3094	297	292.6	0.096	0.095	101.5
ATL	3449	308	303.6	0.089	0.088	101.4
KCA	3616	348	344.1	0.096	0.095	101.1
TEX	2773	327	329.0	0.118	0.119	99.4
MIL	3439	304	307.7	0.088	0.089	98.8
CIN	3639	306	313.2	0.084	0.086	97.7
DET	3089	304	314.8	0.098	0.102	96.6
ANA	3269	310	322.3	0.095	0.099	96.2
TBA	2890	312	324.2	0.108	0.112	96.2
COL	3521	276	288.3	0.078	0.082	95.7
LAN	3232	278	295.7	0.086	0.091	94.0
PIT	1774	144	155.4	0.081	0.088	92.7
PHI	3117	259	281.1	0.083	0.090	92.1
BAL	3662	312	339.9	0.085	0.093	91.8
SFN	1635	144	159.5	0.088	0.098	90.3
NYN	3354	280	323.1	0.083	0.096	86.7
SEA	3439	278	325.0	0.081	0.095	85.5

The Mets just keep coming up as a bad defensive team. At least that’s an area they likely can improve without a huge cost.

The individuals:

Objective PMR, individual rightfielders, 2011. Model built on data from 2005-2010, visiting teams only. 1000 balls in play, minimum.
Fielder	In Play	Actual Outs	Predicted Outs	Actual DER	Predicted DER	Index
Kosuke Fukudome	2564	263	221.9	0.103	0.087	118.5
Shin-Soo Choo	1895	172	153.6	0.091	0.081	112.0
Mike Stanton	2644	271	245.6	0.102	0.093	110.3
Chris Denorfia	1090	98	88.9	0.090	0.082	110.2
David DeJesus	1831	231	209.8	0.126	0.115	110.1
Jayson Werth	2411	277	259.4	0.115	0.108	106.8
Jason Heyward	2308	210	197.6	0.091	0.086	106.3
Justin Upton	3346	306	288.0	0.091	0.086	106.3
Nick Swisher	2516	273	259.8	0.109	0.103	105.1
Jose Bautista	2188	233	221.7	0.106	0.101	105.1
J.D. Drew	1387	142	136.0	0.102	0.098	104.4
Nelson Cruz	1667	216	207.0	0.130	0.124	104.4
Michael Cuddyer	1317	139	134.7	0.106	0.102	103.2
Will Venable	1512	133	129.7	0.088	0.086	102.5
Jeff Francoeur	3385	328	320.4	0.097	0.095	102.4
Carlos Quentin	1784	177	174.3	0.099	0.098	101.5
Corey Hart	2627	236	232.7	0.090	0.089	101.4
Ben Francisco	1007	86	86.0	0.085	0.085	100.0
Seth Smith	2037	166	167.5	0.081	0.082	99.1
Torii Hunter	2637	259	262.7	0.098	0.100	98.6
Jay Bruce	3399	285	290.3	0.084	0.085	98.2
Andre Ethier	2507	221	226.7	0.088	0.090	97.5
Hunter Pence	3083	259	271.8	0.084	0.088	95.3
Lance Berkman	2103	160	169.3	0.076	0.081	94.5
Matthew Joyce	2005	211	228.1	0.105	0.114	92.5
Nick Markakis	3544	301	328.9	0.085	0.093	91.5
Carlos Beltran	2039	169	193.4	0.083	0.095	87.4
Ichiro Suzuki	3225	263	306.8	0.082	0.095	85.7
Magglio Ordonez	1056	87	104.7	0.082	0.099	83.1

Asian players round out the extremes, with Shin-Soo Choo and Kosuke Fukudome at the top, and Ichiro at the bottom. This may be another example of Ichiro losing a step as he ages. The good number posted by Jayson Werth may mean he’s not so bad if and when the Nationals stick him in centerfield. Amazingly, Carlos Beltran comes in worse than Lance Berkman. Maybe the Cardinals are moving the wrong player to first base. 🙂

February 28, 2012

Objective PMR, Third Baseman

1 Reply

The series on objective probabilistic model of range (PMR) continues by looking at third basemen. I’ll show teams as a whole at the position, plus individuals who were on the field for 1000 balls in play. First the teams:

Objective PMR, team third baseman, 2011. Model built on data from 2005-2010, visiting teams only.
Team	In Play	Actual Outs	Predicted Outs	Actual DER	Predicted DER	Index
CLE	3325	454	383.9	0.137	0.115	118.3
PIT	1615	233	202.8	0.144	0.126	114.9
TOR	3190	468	407.2	0.147	0.128	114.9
DET	3105	406	354.6	0.131	0.114	114.5
TBA	3007	427	374.2	0.142	0.124	114.1
ANA	3171	408	364.9	0.129	0.115	111.8
TEX	3038	396	377.9	0.130	0.124	104.8
NYA	3174	379	368.5	0.119	0.116	102.8
CIN	3052	376	367.0	0.123	0.120	102.4
KCA	3202	393	385.4	0.123	0.120	102.0
ARI	2953	398	390.8	0.135	0.132	101.8
BOS	2976	391	385.4	0.131	0.129	101.5
SFN	1447	178	176.8	0.123	0.122	100.7
WAS	3205	383	380.4	0.120	0.119	100.7
SDN	2888	343	352.4	0.119	0.122	97.3
MIL	2959	339	348.9	0.115	0.118	97.2
PHI	3085	399	415.7	0.129	0.135	96.0
LAN	2733	323	337.4	0.118	0.123	95.7
SEA	3326	373	389.8	0.112	0.117	95.7
COL	3037	335	352.8	0.110	0.116	95.0
ATL	2974	343	362.3	0.115	0.122	94.7
SLN	3193	356	391.0	0.111	0.122	91.0
BAL	3162	324	363.2	0.102	0.115	89.2
CHA	3380	369	414.6	0.109	0.123	89.0
OAK	3169	341	387.6	0.108	0.122	88.0
MIN	2963	343	396.4	0.116	0.134	86.5
NYN	3214	354	410.8	0.110	0.128	86.2
FLO	3118	313	368.9	0.100	0.118	84.8
CHN	3021	297	358.8	0.098	0.119	82.8
HOU	2959	305	369.2	0.103	0.125	82.6

I was somewhat worried about Miami having poor left-side defense, but they were already poor last year. Also, Chone Figgins‘s return to third base did not appear to help the Mariner’s defense.

The individuals:

Objective PMR, individual third baseman, 2011. Model built on data from 2005-2010, visiting teams only. 1000 balls in play, minimum.
Fielder	In Play	Actual Outs	Predicted Outs	Actual DER	Predicted DER	Index
Jack Hannahan	1900	267	212.3	0.141	0.112	125.8
Brandon Inge	1562	219	179.8	0.140	0.115	121.8
Evan Longoria	2305	330	285.1	0.143	0.124	115.7
Alberto Callaspo	2474	327	286.8	0.132	0.116	114.0
Lonnie Chisenhall	1068	150	132.1	0.140	0.124	113.6
Adrian Beltre	2023	277	254.8	0.137	0.126	108.7
Alex Rodriguez	1659	206	192.9	0.124	0.116	106.8
Mike Moustakas	1664	211	203.0	0.127	0.122	103.9
Scott Rolen	1098	133	129.2	0.121	0.118	102.9
Ryan Roberts	1858	244	241.4	0.131	0.130	101.1
Ryan Zimmerman	2065	246	244.1	0.119	0.118	100.8
Wilson Betemit	1582	186	185.6	0.118	0.117	100.2
Placido Polanco	2187	294	295.9	0.134	0.135	99.4
Kevin Youkilis	1913	241	243.3	0.126	0.127	99.1
Casey McGehee	2570	296	300.1	0.115	0.117	98.6
Kevin Kouzmanoff	1213	143	151.9	0.118	0.125	94.2
Chase Headley	1877	213	232.7	0.113	0.124	91.5
David Freese	1497	164	180.7	0.110	0.121	90.8
Chone Figgins	1536	160	180.0	0.104	0.117	88.9
Brent Morel	2456	265	300.4	0.108	0.122	88.2
Daniel Descalso	1397	151	172.4	0.108	0.123	87.6
Mark Reynolds	2144	209	239.5	0.097	0.112	87.3
Ty Wigginton	1146	111	128.3	0.097	0.112	86.5
Danny Valencia	2667	303	355.7	0.114	0.133	85.2
Chris Johnson	1869	195	233.0	0.104	0.125	83.7
Scott Sizemore	1706	176	210.2	0.103	0.123	83.7
David Wright	2102	225	269.2	0.107	0.128	83.6
Chipper Jones	2016	208	249.3	0.103	0.124	83.4
Greg Dobbs	1602	159	195.0	0.099	0.122	81.5
Aramis Ramirez	2590	245	309.3	0.095	0.119	79.2

Jack Hannahan isn’t much of an offensive player, and when that’s the case you better be a wizard with the glove. He fit that bill in 2011. With Inge and Longoria ranked second and third, and I’m very happy with the results this produced. I am somewhat surprised Aramis Ramirez ranked so low, although he is getting up in age. Fangraphs ranks his fielding low, however, so maybe it’s not that surprising.

February 28, 2012

Objective PMR, Centerfielders

4 Replies

The series on objective probabilistic model of range (PMR) continues by looking at centerfielders. I’ll show teams as a whole at the position, plus individuals who were on the field for 1000 balls in play. First the teams:

Team	In Play	Actual Outs	Predicted Outs	Actual DER	Predicted DER	Index
PIT	1477	209	184.0	0.142	0.125	113.6
CHA	3206	424	389.9	0.132	0.122	108.7
MIN	2968	459	422.9	0.155	0.142	108.5
MIL	2760	412	381.4	0.149	0.138	108.0
SLN	2792	391	361.9	0.140	0.130	108.0
HOU	2696	388	359.5	0.144	0.133	107.9
COL	2300	388	362.6	0.169	0.158	107.0
PHI	2610	366	343.3	0.140	0.132	106.6
CHN	2882	362	344.6	0.126	0.120	105.1
NYN	2677	389	376.3	0.145	0.141	103.4
SEA	2513	434	421.7	0.173	0.168	102.9
SDN	3021	390	383.4	0.129	0.127	101.7
CIN	2799	377	373.4	0.135	0.133	101.0
ATL	2926	369	367.3	0.126	0.126	100.5
OAK	2361	395	393.6	0.167	0.167	100.4
FLO	3232	384	384.7	0.119	0.119	99.8
DET	3376	422	425.6	0.125	0.126	99.2
ANA	3505	416	420.5	0.119	0.120	98.9
TBA	3204	420	426.9	0.131	0.133	98.4
SFN	1357	195	199.4	0.144	0.147	97.8
BOS	3338	407	420.0	0.122	0.126	96.9
WAS	2771	396	410.4	0.143	0.148	96.5
CLE	3701	390	404.4	0.105	0.109	96.4
TEX	2714	398	415.4	0.147	0.153	95.8
NYA	2624	387	411.3	0.147	0.157	94.1
LAN	2784	337	360.0	0.121	0.129	93.6
BAL	2996	421	452.3	0.141	0.151	93.1
TOR	3064	392	432.2	0.128	0.141	90.7
ARI	3067	382	423.4	0.125	0.138	90.2
KCA	3188	373	426.3	0.117	0.134	87.5

The individuals:

Fielder	In Play	Actual Outs	Predicted Outs	Actual DER	Predicted DER	Index
Andrew McCutchen	1325	198	167.0	0.149	0.126	118.6
Jon Jay	1033	153	130.9	0.148	0.127	116.9
Denard Span	1251	206	176.5	0.165	0.141	116.7
Jordan Schafer	1225	181	158.4	0.148	0.129	114.2
Carlos Gomez	1119	179	157.2	0.160	0.141	113.8
Dexter Fowler	1656	283	258.0	0.171	0.156	109.7
Nyjer Morgan	1354	202	186.3	0.149	0.138	108.5
Shane Victorino	1998	284	262.7	0.142	0.131	108.1
Marlon Byrd	1884	248	234.0	0.132	0.124	106.0
Ben Revere	1563	237	224.4	0.152	0.144	105.6
Alex Rios	2712	349	332.5	0.129	0.123	105.0
Cameron Maybin	2386	318	303.5	0.133	0.127	104.8
Coco Crisp	1800	311	300.1	0.173	0.167	103.6
Franklin Gutierrez	1310	237	228.9	0.181	0.175	103.5
Angel Pagan	1965	278	269.4	0.141	0.137	103.2
Drew Stubbs	2540	345	339.4	0.136	0.134	101.7
Michael Bourn	2677	354	349.3	0.132	0.130	101.3
Austin Jackson	2915	372	367.6	0.128	0.126	101.2
Rick Ankiel	1449	223	221.9	0.154	0.153	100.5
Chris Coghlan	1273	145	145.7	0.114	0.114	99.5
Colby Rasmus	2124	290	295.8	0.137	0.139	98.0
B.J. Upton	2911	382	391.2	0.131	0.134	97.6
Jacoby Ellsbury	3101	377	386.9	0.122	0.125	97.4
Ezequiel Carrera	1091	112	115.3	0.103	0.106	97.2
Michael Brantley	1108	118	122.0	0.106	0.110	96.7
Peter Bourjos	3054	351	363.3	0.115	0.119	96.6
Grady Sizemore	1138	119	125.0	0.105	0.110	95.2
Curtis Granderson	2457	354	383.5	0.144	0.156	92.3
Adam Jones	2693	377	408.7	0.140	0.152	92.3
Matt Kemp	2680	320	346.9	0.119	0.129	92.2
Chris Young	2939	367	405.8	0.125	0.138	90.4
Rajai Davis	1390	174	194.7	0.125	0.140	89.4
Melky Cabrera	2759	316	371.9	0.115	0.135	85.0

The thing that surprises me the most is Peter Bourjos’s ranking. He’s supposed to be really fast, but it could be his judgement hasn’t developed yet. FanGraphs does rate him much higher, however.

The Giants are going to be in big trouble if it turns out that Melky Cabrera’s hitting was a one-year wonder. They’ll end up with a centerfielder who can neither hit nor field.

February 27, 2012

Objective PMR, Second Basemen

5 Replies

The series on objective probabilistic model of range (PMR) continues by looking at second basemen. I’ll show teams as a whole at the position, plus individuals who were on the field for 1000 balls in play. First the teams:

Objective PMR, team second baseman, 2011. Model built on data from 2005-2010, visiting teams only.
Team	In Play	Actual Outs	Predicted Outs	Actual DER	Predicted DER	Index
FLO	3122	589	493.3	0.189	0.158	119.4
PHI	3099	528	471.3	0.170	0.152	112.0
NYA	3208	539	490.6	0.168	0.153	109.9
SLN	3208	510	466.2	0.159	0.145	109.4
WAS	3219	503	471.9	0.156	0.147	106.6
CHA	3282	546	521.5	0.166	0.159	104.7
TEX	3048	512	490.4	0.168	0.161	104.4
ARI	2955	449	429.9	0.152	0.145	104.4
CHN	3024	480	460.7	0.159	0.152	104.2
BOS	3018	496	483.3	0.164	0.160	102.6
SEA	3189	514	502.0	0.161	0.157	102.4
NYN	3223	503	492.8	0.156	0.153	102.1
TOR	3213	509	502.3	0.158	0.156	101.3
OAK	3168	519	512.8	0.164	0.162	101.2
HOU	2964	437	432.4	0.147	0.146	101.1
TBA	3018	494	490.7	0.164	0.163	100.7
MIL	2969	440	439.0	0.148	0.148	100.2
COL	3044	468	468.2	0.154	0.154	100.0
CIN	3064	463	464.1	0.151	0.151	99.8
MIN	3249	532	536.1	0.164	0.165	99.2
ATL	3035	463	467.7	0.153	0.154	99.0
LAN	2748	422	432.0	0.154	0.157	97.7
ANA	3196	527	540.1	0.165	0.169	97.6
KCA	3224	507	524.8	0.157	0.163	96.6
CLE	3366	519	538.9	0.154	0.160	96.3
DET	3132	486	510.9	0.155	0.163	95.1
BAL	3416	492	521.3	0.144	0.153	94.4
SDN	2890	438	467.2	0.152	0.162	93.8
SFN	1457	211	226.5	0.145	0.155	93.2
PIT	1622	209	238.3	0.129	0.147	87.7

While the Yankees and Marlins didn’t produce much range at shortstop, they made up for it with great range at second base. Philadelphia remains high despite Chase Utley‘s injury. Employing veteran second basemen didn’t help the Giants and Padres much.

The individuals:

Objective PMR, individual second basemen, 2011. Model built on data from 2005-2010, visiting teams only.
Fielder	In Play	Actual Outs	Predicted Outs	Actual DER	Predicted DER	Index
Omar Infante	2730	526	433.5	0.193	0.159	121.3
Chase Utley	1778	313	266.3	0.176	0.150	117.5
Skip Schumaker	1608	262	229.5	0.163	0.143	114.2
Robinson Cano	2954	495	450.5	0.168	0.153	109.9
Darwin Barney	2265	378	344.2	0.167	0.152	109.8
Ian Kinsler	2694	466	430.3	0.173	0.160	108.3
Danny Espinosa	3096	489	454.9	0.158	0.147	107.5
Gordon Beckham	2918	494	463.1	0.169	0.159	106.7
Mark Ellis	2313	395	371.5	0.171	0.161	106.3
Dustin Pedroia	2874	476	458.8	0.166	0.160	103.7
Jeff Keppinger	1055	165	159.7	0.156	0.151	103.3
Jemile Weeks	1857	303	294.3	0.163	0.158	103.0
Alexi Casilla	1003	176	172.1	0.175	0.172	102.2
Aaron Hill	2588	410	401.9	0.158	0.155	102.0
Justin Turner	1461	221	218.2	0.151	0.149	101.3
Orlando Cabrera	1582	260	257.4	0.164	0.163	101.0
Ben Zobrist	2206	359	358.2	0.163	0.162	100.2
Dan Uggla	2943	455	455.6	0.155	0.155	99.9
Chris Getz	1961	316	318.2	0.161	0.162	99.3
Dustin Ackley	1706	267	269.4	0.157	0.158	99.1
Brandon Phillips	2783	412	418.9	0.148	0.151	98.4
Howie Kendrick	1999	326	332.8	0.163	0.166	98.0
Orlando Hudson	1848	305	312.9	0.165	0.169	97.5
Rickie Weeks	2035	286	295.9	0.141	0.145	96.6
Kelly Johnson	2583	364	377.1	0.141	0.146	96.5
Robert Andino	1744	250	259.7	0.143	0.149	96.3
Jamey Carroll	1020	153	159.9	0.150	0.157	95.7
Neil Walker	1550	202	226.9	0.130	0.146	89.0

I’m very impressed that Chase held up so well despite his age and injuries. Orlando Hudson lost a lot of ground over the years. Looking at the list, there are quite a few second basemen who hit and field well, especially Utley, Cano, Kinsler, and Pedroia. We’ll see how far up the list Jemile Weeks and Dustin Ackley move playing a full season.

February 27, 2012

Objective PMR, Shortstops

3 Replies

The series on objective probabilistic model of range (PMR) continues by looking at shortstops. I’ll show teams as a whole at the position, plus individuals who were on the field for 1000 balls in play. First the teams:

Objective PMR, team shortstops, 2011. Model built on data from 2005-2010, visiting teams only.
Team	In Play	Actual Outs	Predicted Outs	Actual DER	Predicted DER	Index
BAL	3200	585	505.6	0.183	0.158	115.7
CIN	3084	537	482.9	0.174	0.157	111.2
TEX	3057	539	491.0	0.176	0.161	109.8
ATL	3010	525	482.3	0.174	0.160	108.9
CHN	3045	519	477.8	0.170	0.157	108.6
SEA	3195	540	498.9	0.169	0.156	108.2
SFN	1479	249	230.7	0.168	0.156	107.9
ANA	3196	537	510.6	0.168	0.160	105.2
COL	3060	514	488.7	0.168	0.160	105.2
SDN	2917	484	461.5	0.166	0.158	104.9
DET	3128	501	478.3	0.160	0.153	104.7
PIT	1628	278	265.8	0.171	0.163	104.6
CHA	3288	526	506.3	0.160	0.154	103.9
HOU	2984	495	477.4	0.166	0.160	103.7
TBA	3245	486	476.2	0.150	0.147	102.1
KCA	3225	539	529.0	0.167	0.164	101.9
WAS	3218	518	508.3	0.161	0.158	101.9
PHI	3262	507	498.5	0.155	0.153	101.7
ARI	3211	494	489.9	0.154	0.153	100.8
TOR	3212	505	503.8	0.157	0.157	100.2
MIL	2992	461	462.3	0.154	0.155	99.7
LAN	2776	443	449.9	0.160	0.162	98.5
SLN	3238	489	505.4	0.151	0.156	96.8
BOS	3021	469	489.2	0.155	0.162	95.9
MIN	3261	505	538.7	0.155	0.165	93.7
OAK	3185	476	515.2	0.149	0.162	92.4
FLO	3381	440	481.2	0.130	0.142	91.4
CLE	3374	469	535.2	0.139	0.159	87.6
NYN	3271	486	558.0	0.149	0.171	87.1
NYA	3190	426	499.3	0.134	0.157	85.3

As you can see, it was a bad year to be a shortstop in New York. You can also see that in the individual numbers:

Objective PMR, individual shortstops, 2011. Model built on data from 2005-2010, visiting teams only.
Fielder	In Play	Actual Outs	Predicted Outs	Actual DER	Predicted DER	Index
Paul Janish	1528	272	238.3	0.178	0.156	114.2
Brendan Ryan	2346	408	364.6	0.174	0.155	111.9
Alex Gonzalez	2667	476	427.2	0.178	0.160	111.4
J.J. Hardy	2508	433	392.1	0.173	0.156	110.4
Rafael Furcal	1479	246	226.0	0.166	0.153	108.9
Starlin Castro	2975	505	466.8	0.170	0.157	108.2
Elvis Andrus	2672	467	432.8	0.175	0.162	107.9
Edgar Renteria	1368	232	216.1	0.170	0.158	107.4
Clint Barmes	2122	361	337.4	0.170	0.159	107.0
Ronny Cedeno	1243	218	204.5	0.175	0.164	106.6
Reid Brignac	1550	246	231.5	0.159	0.149	106.3
Troy Tulowitzki	2565	429	407.2	0.167	0.159	105.3
Alexei Ramirez	3119	503	479.7	0.161	0.154	104.9
Erick Aybar	2784	461	442.6	0.166	0.159	104.2
Jimmy Rollins	2681	426	414.4	0.159	0.155	102.8
Alcides Escobar	3090	518	506.6	0.168	0.164	102.3
Jason Bartlett	2449	395	386.7	0.161	0.158	102.2
Yuniesky Betancourt	2661	421	412.9	0.158	0.155	102.0
Jhonny Peralta	2708	422	415.8	0.156	0.154	101.5
Ian Desmond	2930	464	463.7	0.158	0.158	100.1
Stephen Drew	1650	248	247.7	0.150	0.150	100.1
Yunel Escobar	2474	378	388.1	0.153	0.157	97.4
Tsuyoshi Nishioka	1134	180	185.8	0.159	0.164	96.9
Ryan Theriot	1702	259	268.8	0.152	0.158	96.3
Marco Scutaro	1920	301	314.2	0.157	0.164	95.8
Emilio Bonifacio	1338	180	190.5	0.135	0.142	94.5
Willie Bloomquist	1074	157	166.8	0.146	0.155	94.1
Cliff Pennington	2800	412	452.7	0.147	0.162	91.0
Hanley Ramirez	1726	220	244.4	0.127	0.142	90.0
Jamey Carroll	1018	142	162.8	0.139	0.160	87.2
Asdrubal Cabrera	3085	423	488.3	0.137	0.158	86.6
Jose Reyes	2471	363	421.9	0.147	0.171	86.0
Derek Jeter	2294	303	358.7	0.132	0.156	84.5

Jim Leyland praised Jhonny Peralta’s defense the other day, and this chart shows he has a point. I don’t think there are that many surprises at the top of the list. Cliff Pennington has to be a disappointment. He’s not a great hitter, so he needs to be a great fielder to be valuable. It really is amazing that no matter how you measure Jeter’s defense, he just doesn’t get to that many balls.

The thing that really stands out, however, is the left-side of the Marlins infield. Jose Reyes and Hanley Ramirez were among the worst shortstops in the majors in 2011. Now they are going to be patrolling that whole side together. Mark Buehrle might be in for an unpleasant surprise this year.

The information used here was obtained free of charge from and is copyrighted by Retrosheet. Interested parties may contact Retrosheet at 20 Sunset Rd., Newark, DE 19711.

February 26, 2012

PMR Explanation

3 Replies

For those of you new to the Probabilistic Model of Range (PMR), here’s a video explanation.

February 26, 2012

Objective PMR, 2011

6 Replies

I recently downloaded the MYSQL version of the retrosheet database, and am ready to look at the objective probabilistic model of range (PMR). It’s objective since it does not include the subjective vector and hard hit parameters that reporters enter in both the STATS and BIS data. In other words, the models for objective PMR just use batter handedness, pitcher handedness, batted ball type, and park to build the model. I’m using data from 2005-2011 for visiting teams only for model. Note that this means that 2011 fielding has no influence on the models.

As with the normal PMR model, I’m measuring the defensive efficiency record (DER) of the team against the expected DER. The following table shows the results for the 30 teams.

Objective DER, Teams, 2011. Model built on data from 2005-2010.
Team	In Play	Actual Outs	Predicted Outs	Actual DER	Predicted DER	Index
TBA	4157	3007	2917.7	0.723	0.702	103.1
ANA	4413	3106	3030.1	0.704	0.687	102.5
CLE	4462	3083	3024.5	0.691	0.678	101.9
CIN	4053	2867	2820.5	0.707	0.696	101.7
BOS	4167	2926	2884.9	0.702	0.692	101.4
WAS	4259	2967	2930.7	0.697	0.688	101.2
ATL	4054	2825	2792.6	0.697	0.689	101.2
TEX	4161	2923	2891.1	0.702	0.695	101.1
ARI	4075	2867	2838.0	0.704	0.696	101.0
SEA	4298	3000	2972.1	0.698	0.692	100.9
SLN	4162	2855	2833.0	0.686	0.681	100.8
TOR	4324	3002	2981.6	0.694	0.690	100.7
PHI	4016	2828	2808.9	0.704	0.699	100.7
MIL	3955	2732	2712.7	0.691	0.686	100.7
LAN	3735	2619	2601.6	0.701	0.697	100.7
FLO	4140	2862	2842.5	0.691	0.687	100.7
SDN	3921	2764	2748.8	0.705	0.701	100.6
NYA	4277	2936	2922.6	0.686	0.683	100.5
DET	4232	2932	2922.7	0.693	0.691	100.3
CHA	4337	2978	2971.6	0.687	0.685	100.2
CHN	3985	2706	2699.4	0.679	0.677	100.2
SFN	1992	1389	1390.7	0.697	0.698	99.9
HOU	3966	2694	2698.4	0.679	0.680	99.8
PIT	2118	1447	1450.1	0.683	0.685	99.8
COL	4022	2760	2765.1	0.686	0.688	99.8
KCA	4426	3045	3059.1	0.688	0.691	99.5
MIN	4491	3042	3066.5	0.677	0.683	99.2
OAK	4224	2917	2942.4	0.691	0.697	99.1
BAL	4416	3018	3060.8	0.683	0.693	98.6
NYN	4286	2926	2993.4	0.683	0.698	97.7

If you look at the ranking of the teams using UZR/150, there is some agreement between the two systems. Four of the top five teams in objective PMR also ranke in the top five in UZR/150. There is a glaring difference, however, and that’s the Cleveland Indians. UZR ranks the Indians as the 2nd poorest fielding team in the majors, while objective PMR ranks them second. Where are the differences?

UZR/150 rankings versus objective PMR rankings for Indians positions, 2011
Position	UZR/150 Rank	Objective PMR Rank
Pitcher	N/A	2
Catcher	N/A	6
First base	28	13
Second base	30	25
Third base	5	1
Shortstop	28	28
Leftfield	21	20
Centerfield	27	23
Rightfield	12	1

As you can see, UZR does not rank ranges for pitchers and catchers, and PMR rated both those positions well for the Indians. Apart from that, most positions are in decent agreement. Both systems rank the shortstops poorly and the third basemen well. The two biggest discrepancies occur at first base and rightfield. I’m not arguing that one is right and one is wrong. It could be that a bias exists in that direction on the field. Maybe it’s the camera angle, so the balls to that side of the field look easier to field than they actually are. On the other hand, the vectors and batted ball velocities may add important information to the model that knocks the Indians down.

It could also be both those effects are in place, and the Indians real fielding value lies somewhere between the two.

The big differences make for the most interesting comparisons. I’d love to be able to dig deeper into this to see what the distribution of fly balls and ground balls is for the Indians, to see which model is doing a better job of creating a model.

I’ll continue this series looking at teams and fielders by position.

The information used here was obtained free of charge from and is copyrighted by Retrosheet. Interested parties may contact Retrosheet at 20 Sunset Rd., Newark, DE 19711.

May 23, 2011

Range and Positioning

3 Replies

I did a poor job of blogging Mitchel Litchman’s talk on Sunday, but there were some points about which I’d like to comment.

The first is that Mitchel stopped penalizing the other fielder when an out is made on a shared play. A shared play is when there is some probability that multiple fielders might catch a ball (think of a fly ball to right-center). Mitchel noted that he’s gone back and forth on this over the years, but finally decided against the penalty. His argument was that the batted ball locations we get from the human scoring systems are not precise, and the reason one fielder makes the play compared to the other is that he was closer to the ball to start.

The Probabilistic Model of Range (PMR) does give the fielder not making the out the penalty, as I believe there is information there. Back in the 1990s, STATS, Inc. published Zone Ratings, and Ken Griffey, Jr. did poorly. That surprised people, because they always saw him on ESPN making these fantastic catches at the wall. John Dewan asked me to look into this, and I found the problem was that Griffey was the opposite of a ball hog. He allowed his left and right fielders to take balls that he should have been catching. Now, it’s possible this was a good strategy; Griffey could play deep and make the catches at the wall while the LF and RF took the short flies. It made him look bad compared to other centerfielders, who tried to catch everything. Ken had the reputation when he was young of not trying on every play, however, so maybe it was Ken just didn’t want to go after balls and let his side fielders take up the slack. Since there weren’t an over abundance of balls dropping in for hits in front of him, I somewhat discount the playing deep theory.

~~PMR does not try to turn the numbers into runs, like UZR. In a system that assigns run values, I agree that the fielder not making the out should not be penalized~~. After a few minutes thinking about this, I would penalize the fielder. The run values are built into the model. So when a fielder doesn’t get to a ball, the run values for the outs are built into his model. They’re just not his outs. He doesn’t get charged with a hit, he gets charged with the run expectation for the batted ball, including outs made by other fielders. I think I was right the first time. To paraphrase Dennis Moore, this redistribution of runs in trickier than I thought.

My guess is that there’s not much difference, and that over a large enough sample all these things even out.

Update: See update at the end of the post.

(By the way, I showed there that shared plays account for a pretty small amount of balls in play.)

Mitchel’s point that who catches the ball tells us about the positioning is a fascinating argument, and he takes that a step further with errors. Most systems, like PMR, treat a ball in play as a binary outcome. Either an out was produced or not. It doesn’t matter if it was scored a hit, error, or failed fielder’s choice, the batter was either out or safe. Mitchell notes that if a fielder makes an error on a bin of parameters with a low probability for making an out, he must have been positioned near where the ball was hit. Again, the human marking of where a ball was hit likely has a high variance. If you pick a spot on the field, and look at where reporters mark the ball, you probably get a circle with that point as the center. Since errors tend to happen on balls that should be easy to field, it’s likely the ball was hit closer to the fielder than the probability model indicates. Therefore, Mitchel imposes a bigger penalty on errors.

In other words, the error is telling us that the model we’re using shifted. The problem, of course, is that it doesn’t capture hits that happen when the batter makes a similar shift and is not charged with an error. He gets charged with a low probability event, when he should be charged with a higher one. I might even argue that the fact that the fielder shifted and made the correct decision in the shift should lessen the penalty, since his process was right, even though the result was bad.

What I really take away from this, however, is that it might be possible to build a probabilistic model of positioning. For example, if you look at centerfielders against left and right handed batters, and you see him making more plays on one side of the field based on handedness or pull percentage of the fielders, you can infer that the fielder moves to one side of the field or the other. If you see the distributions look the same, however, he’s probably standing in one place and not moving much.

(PMR accounts for handedness of both the batter and the pitcher, so this should be built into the model.)

Mitchel is trying to tackle the big problem in defense, that range as we measure it is really a combination of the ability to move and the ability to position fielders. Until we get FIELDf/x, we have a very difficult time separating the two.

My one other comment on Mitchel’s talk was that With or Without You (WOWY) was showing much larger run difference between good and bad fielders than UZR. Mitchel threw out a couple of theories why, but I think it just may be a sample size problem. Since the start of the 1996 season, Jeter played 2323 games. That means he played about 94% of the Yankees games in that time. So the sample without Jeter is 150 games, and I don’t know if Mitchel’s data goes back that far. Most fielders who play enough to qualify as bad at the position must do something else good to stay in the lineup, so in general the without you component is small. Mitchel likely took that into account, but that was the first thing that crossed my mind.

Update: Here are my formal thoughts on the way runs should be charged under PMR. Let’s propose a play that looks like this to the team, a fly ball halfway between the default positions for the center and right fielder:

Out: 25%, -0.07 runs per ball in play
Single 50%, .235 runs per ball in play
Double 20%, .154 runs per ball in play
Triple 5%, .0545 runs per ball in play

So the expected runs per ball in play is .3735, or 37.35 runs per 100 balls in play. In other words, batter should really try to put the ball in play in that spot. We then add up the run value for the actual events on those balls. If the outfielders are good, the team should sum to less than 37.35 runs. If they are poor, more runs.

This is a shared vector, however, and of the 100 balls in play, 15 are caught by the centerfielder, 10 by the rightfielder. Since the centerfielder gets to 60% of the outs, I’ll assume he gets 60% of the run expectation, and rightfielder 40%. So when a ball falls in for a hit, the CF and RF split the run value for that event 60-40.

Now what happens when the ball is caught. The program could do that same thing, and split the run value of the out 60-40, or give all the run value to the fielder who caught the ball and a zero to the fielder who didn’t. If you split, you’re not penalizing the fielder for who didn’t catch the ball. If you go with the full value of the out to the fielder who catches it, you are penalizing the fielder who doesn’t catch the ball.

If the actual data is around a 60-40 split, it doesn’t matter which system you use, because the results will come out the same. It does matter, matter, however, if the CF is catching 80% of those balls. Mitchel would argue that the scoring of the location of the ball is off, or the outfielders are positioned differently, so don’t penalize the RF for something that isn’t his fault. There is the possibility, however, that the CF is making those plays because he’s a much better outfielder than the RF, in which case the penalty should be applied, and the outfielder making the catch should get full credit for the out. This is one of those known unknowns.

Maybe the right way to do this is to give the fielder making the catch a little more credit. I just feel there is information that we are throwing away if we split the credit on a catch proportionally.

March 2, 2011

Five-Year PMR, First Basemen

Leave a reply

The five year look at PMR using an objective probabilistic model of range continues with first basemen. Here is the data for the teams at first over the five year span:

Team first baseman PMR, 2006-2010, four parameter objective model built on visiting team data.
Team	In Play	Actual Outs	Predicted Outs	Actual DER	Predicted DER	Index
SLN	17214	1796	1608.4	0.104	0.093	111.7
SFN	16122	1571	1421.7	0.097	0.088	110.5
CHN	15206	1539	1405.7	0.101	0.092	109.5
ANA	15603	1642	1515.5	0.105	0.097	108.3
HOU	15980	1618	1521.1	0.101	0.095	106.4
TOR	16348	1641	1554.2	0.100	0.095	105.6
SEA	16560	1537	1467.1	0.093	0.089	104.8
PIT	16826	1510	1443.8	0.090	0.086	104.6
SDN	16124	1620	1556.1	0.100	0.097	104.1
NYN	15938	1517	1463.6	0.095	0.092	103.6
CHA	16428	1455	1431.6	0.089	0.087	101.6
BOS	15700	1568	1544.1	0.100	0.098	101.5
NYA	15989	1488	1475.2	0.093	0.092	100.9
BAL	16368	1496	1482.9	0.091	0.091	100.9
FLO	15932	1492	1491.8	0.094	0.094	100.0
TBA	15733	1532	1551.2	0.097	0.099	98.8
TEX	16633	1471	1489.5	0.088	0.090	98.8
OAK	16196	1484	1510.8	0.092	0.093	98.2
KCA	16341	1447	1503.0	0.089	0.092	96.3
DET	16542	1483	1541.2	0.090	0.093	96.2
CIN	16315	1527	1587.8	0.094	0.097	96.2
LAN	16233	1422	1485.1	0.088	0.091	95.8
CLE	16856	1441	1519.1	0.085	0.090	94.9
MIN	15940	1391	1469.6	0.087	0.092	94.7
PHI	16534	1529	1618.2	0.092	0.098	94.5
MIL	16049	1434	1543.8	0.089	0.096	92.9
ARI	16358	1496	1616.5	0.091	0.099	92.5
WAS	16362	1491	1618.9	0.091	0.099	92.1
ATL	16077	1500	1630.7	0.093	0.101	92.0
COL	17060	1511	1644.1	0.089	0.096	91.9

I’m not at all surprised by St. Louis ranking number one. The years I computed the more advance PMR stats, Albert Pujols was head and shoulders above the other first basemen. The Giants have not been great offensively at the position during the time span, but they at least made up a bit for it with defense. The Yankees, with a balance between Jason Giambi and Mark Teixeira over the period come out league average. Next come the players, regulars and semi regulars:

Individual first baseman PMR, 2006-2010, four parameter objective model built on visiting team data.
Fielder	In Play	Actual Outs	Predicted Outs	Actual DER	Predicted DER	Index
Albert Pujols	15592	1659	1460.7	0.106	0.094	113.6
Kendry Morales	5075	553	502.5	0.109	0.099	110.0
Derrek Lee	11528	1169	1074.3	0.101	0.093	108.8
Lance Berkman	11643	1191	1102.7	0.102	0.095	108.0
Mark Teixeira	14352	1449	1360.2	0.101	0.095	106.5
Lyle Overbay	13371	1352	1273.3	0.101	0.095	106.2
Adrian Gonzalez	15367	1558	1483.0	0.101	0.097	105.1
Adam LaRoche	14185	1350	1283.9	0.095	0.091	105.1
Casey Kotchman	9441	947	903.3	0.100	0.096	104.8
Daric Barton	6616	632	605.3	0.096	0.091	104.4
Kevin Millar	7349	672	649.4	0.091	0.088	103.5
Kevin Youkilis	9955	1022	987.7	0.103	0.099	103.5
Nick Johnson	6023	616	597.9	0.102	0.099	103.0
Joey Votto	8267	837	814.3	0.101	0.099	102.8
Paul Konerko	12850	1134	1111.5	0.088	0.087	102.0
Richie Sexson	6934	625	614.2	0.090	0.089	101.8
Carlos Delgado	8863	835	831.7	0.094	0.094	100.4
Aubrey Huff	4968	458	464.3	0.092	0.093	98.6
Carlos Pena	10393	1022	1038.7	0.098	0.100	98.4
James Loney	11073	1008	1024.5	0.091	0.093	98.4
Ross Gload	5009	462	472.5	0.092	0.094	97.8
Justin Morneau	12564	1127	1175.0	0.090	0.094	95.9
Ryan Howard	15200	1413	1487.3	0.093	0.098	95.0
Miguel Cabrera	8644	777	823.5	0.090	0.095	94.4
Mike Jacobs	6755	565	599.1	0.084	0.089	94.3
Sean Casey	5168	431	458.2	0.083	0.089	94.1
Todd Helton	13067	1175	1271.6	0.090	0.097	92.4
Ryan Garko	7416	616	667.0	0.083	0.090	92.3
Billy Butler	6015	524	568.0	0.087	0.094	92.3
Prince Fielder	15328	1355	1473.4	0.088	0.096	92.0
Nomar Garciaparra	4041	324	362.5	0.080	0.090	89.4
Conor Jackson	5935	524	591.3	0.088	0.100	88.6
Scott Hatteberg	4560	378	433.0	0.083	0.095	87.3
Adam Dunn	4546	375	439.0	0.082	0.097	85.4
Jason Giambi	5062	380	448.5	0.075	0.089	84.7

I don’t have too much to say about this list. Albert Pujols comes out on top, which is what I expected. The people lower on the list tend to be bigger, heavier, older players. No one should be surprised to see Giambi and Dunn near the bottom. I thought Carlos Pena would rank higher and Lance Berkman lower.

I would like to point out something about Albert Pujols that shows his ranking is correct, even without a model. Albert was the first fielder to touch the ball on 1659 outs from 2006-2010. The closest an entire team came to that number was 1642 by all the Angels first basemen. When someone signs him for next season, they are getting the total package, great hitter, incredible fielder.

February 28, 2011

Five-Year PMR, Leftfielders

Leave a reply

The five year look at PMR using an objective probabilistic model of range continues with leftfielders. Here is the data for the teams in left over the five year span:

Team leftfielder PMR, 2006-2010, four parameter objective model built on visiting team data.
Team	In Play	Actual Outs	Predicted Outs	Actual DER	Predicted DER	Index
CHN	15774	1536	1439.3	0.097	0.091	106.7
ATL	15636	1494	1401.5	0.096	0.090	106.6
PIT	18139	1594	1497.5	0.088	0.083	106.4
KCA	18757	1781	1679.8	0.095	0.090	106.0
NYA	15628	1586	1504.5	0.101	0.096	105.4
TBA	17771	1722	1640.0	0.097	0.092	105.0
MIL	17605	1541	1470.1	0.088	0.084	104.8
BAL	18601	1775	1703.9	0.095	0.092	104.2
ARI	15518	1619	1554.6	0.104	0.100	104.1
COL	17608	1462	1415.8	0.083	0.080	103.3
LAN	15747	1416	1379.9	0.090	0.088	102.6
SFN	15583	1566	1526.4	0.100	0.098	102.6
WAS	13948	1620	1601.8	0.116	0.115	101.1
BOS	15990	1512	1501.2	0.095	0.094	100.7
SEA	18929	1668	1659.2	0.088	0.088	100.5
DET	16254	1624	1618.3	0.100	0.100	100.4
SDN	15744	1574	1578.5	0.100	0.100	99.7
NYN	15106	1529	1544.6	0.101	0.102	99.0
ANA	18139	1617	1642.7	0.089	0.091	98.4
TEX	16842	1603	1630.1	0.095	0.097	98.3
OAK	17009	1614	1649.7	0.095	0.097	97.8
CLE	19103	1585	1622.7	0.083	0.085	97.7
TOR	17621	1372	1439.9	0.078	0.082	95.3
MIN	17584	1584	1665.9	0.090	0.095	95.1
SLN	16933	1455	1548.5	0.086	0.091	94.0
CHA	18013	1555	1663.2	0.086	0.092	93.5
CIN	18020	1512	1627.7	0.084	0.090	92.9
PHI	14144	1380	1485.7	0.098	0.105	92.9
FLO	16420	1423	1548.2	0.087	0.094	91.9
HOU	18038	1318	1489.6	0.073	0.083	88.5

From this one might expect Alfonso Soriano to be a pretty good leftfielder. He’s okay, but I guess everyone else who filled in was really good:

Individual leftfielder PMR, 2006-2010, four parameter objective model built on visiting team data.
Fielder	In Play	Actual Outs	Predicted Outs	Actual DER	Predicted DER	Index
Jay Payton	4591	492	420.2	0.107	0.092	117.1
Matt Diaz	4924	537	461.7	0.109	0.094	116.3
David DeJesus	5747	569	504.3	0.099	0.088	112.8
Johnny Damon	4957	494	456.4	0.100	0.092	108.2
Eric Byrnes	4067	445	411.5	0.109	0.101	108.1
Carl Crawford	14539	1447	1347.8	0.100	0.093	107.4
Ryan Braun	9738	859	819.1	0.088	0.084	104.9
Juan Pierre	5897	560	540.4	0.095	0.092	103.6
Jason Bay	13991	1275	1237.0	0.091	0.088	103.1
Luke Scott	4081	360	352.5	0.088	0.086	102.1
Shannon Stewart	4258	388	382.5	0.091	0.090	101.4
Alfonso Soriano	11797	1163	1148.8	0.099	0.097	101.2
David Murphy	4343	421	419.5	0.097	0.097	100.3
Matt Holliday	15908	1346	1353.0	0.085	0.085	99.5
Scott Podsednik	8245	763	768.9	0.093	0.093	99.2
Juan Rivera	6407	583	589.7	0.091	0.092	98.9
Emil Brown	4408	403	408.7	0.091	0.093	98.6
Fred Lewis	5194	478	492.6	0.092	0.095	97.0
Jason Michaels	5343	436	450.7	0.082	0.084	96.7
Garret Anderson	8106	701	734.6	0.086	0.091	95.4
Luis Gonzalez	6334	553	581.7	0.087	0.092	95.1
Chris Coghlan	4039	364	383.0	0.090	0.095	95.0
Adam Dunn	9522	833	890.6	0.087	0.094	93.5
Barry Bonds	4055	350	376.5	0.086	0.093	93.0
Raul Ibanez	14482	1253	1349.4	0.087	0.093	92.9
Manny Ramirez	8953	750	808.1	0.084	0.090	92.8
Josh Willingham	10019	911	983.5	0.091	0.098	92.6
Adam Lind	4535	356	388.7	0.079	0.086	91.6
Carlos Quentin	4582	390	428.8	0.085	0.094	91.0
Pat Burrell	7730	704	778.7	0.091	0.101	90.4
Craig Monroe	4170	351	392.7	0.084	0.094	89.4
Chris Duncan	4449	376	423.7	0.085	0.095	88.7
Delmon Young	8084	694	802.6	0.086	0.099	86.5
Carlos Lee	14907	1069	1238.1	0.072	0.083	86.3

At least the Royals are above average somewhere in the field. It’s not surprise that Carl Crawford is the best long term leftfielder and the slugging Carlos Lee is the worst. It also appears moving Ryan Braun off third base was the right strategy.

FanGraphs agrees on the top ranking for Carl Crawford, and also rates Soriano and Matt Murton highly.

February 26, 2011

Five-Year PMR, Rightfielders

Leave a reply

The five year look at PMR using an objective probabilistic model of range continues with rightfielders. Here is the data for the teams in right over the five year span:

Team rightfielder PMR, 2006-2010, four parameter objective model built on visiting team data.
Team	In Play	Actual Outs	Predicted Outs	Actual DER	Predicted DER	Index
TEX	17063	1816	1664.4	0.106	0.098	109.1
ARI	18222	1596	1486.0	0.088	0.082	107.4
FLO	15005	1744	1628.4	0.116	0.109	107.1
CLE	16448	1718	1623.7	0.104	0.099	105.8
CIN	19014	1859	1768.9	0.098	0.093	105.1
SDN	18477	1642	1565.8	0.089	0.085	104.9
TOR	15576	1522	1462.0	0.098	0.094	104.1
SFN	16350	1756	1692.9	0.107	0.104	103.7
HOU	18435	1702	1646.5	0.092	0.089	103.4
WAS	15208	1819	1767.7	0.120	0.116	102.9
KCA	18252	1767	1739.5	0.097	0.095	101.6
MIN	17388	1639	1617.1	0.094	0.093	101.4
OAK	14344	1705	1684.4	0.119	0.117	101.2
PHI	16848	1667	1653.6	0.099	0.098	100.8
CHA	14957	1592	1583.7	0.106	0.106	100.5
BOS	17028	1569	1561.2	0.092	0.092	100.5
TBA	14795	1697	1694.1	0.115	0.115	100.2
NYA	15578	1657	1653.8	0.106	0.106	100.2
SLN	19101	1590	1607.5	0.083	0.084	98.9
LAN	17788	1538	1563.4	0.086	0.088	98.4
NYN	14262	1637	1689.5	0.115	0.118	96.9
MIL	18218	1684	1749.2	0.092	0.096	96.3
BAL	17931	1674	1740.7	0.093	0.097	96.2
ATL	17318	1538	1606.5	0.089	0.093	95.7
SEA	17452	1657	1745.2	0.095	0.100	94.9
ANA	15609	1604	1691.5	0.103	0.108	94.8
PIT	18249	1650	1747.4	0.090	0.096	94.4
CHN	16072	1632	1733.3	0.102	0.108	94.2
DET	15483	1533	1645.3	0.099	0.106	93.2
COL	18078	1404	1536.7	0.078	0.085	91.4

A comparison of FanGraphs range runs shows that four of the top eight teams are shared between the two lists. The top teams on the objective PMR list tend to have younger outfielders:

Individual rightfielder PMR, 2006-2010, four parameter objective model built on visiting team data.
Fielder	In Play	Actual Outs	Predicted Outs	Actual DER	Predicted DER	Index
Nelson Cruz	6996	809	706.4	0.116	0.101	114.5
Alexis Rios	8847	859	783.4	0.097	0.089	109.7
Jay Bruce	6701	686	629.4	0.102	0.094	109.0
Randy Winn	8227	940	867.5	0.114	0.105	108.4
Justin Upton	8906	799	738.7	0.090	0.083	108.2
Nick Swisher	5275	639	599.4	0.121	0.114	106.6
Hunter Pence	10667	1035	976.6	0.097	0.092	106.0
Jeremy Hermida	7305	843	797.4	0.115	0.109	105.7
Shin-Soo Choo	6720	702	664.2	0.104	0.099	105.7
Carlos Quentin	4297	417	397.1	0.097	0.092	105.0
Ryan Ludwick	8676	751	721.7	0.087	0.083	104.1
Jayson Werth	7953	828	798.4	0.104	0.100	103.7
Gabe Gross	4208	472	459.6	0.112	0.109	102.7
Austin Kearns	8814	1034	1008.4	0.117	0.114	102.5
Jermaine Dye	10036	1090	1067.6	0.109	0.106	102.1
J.D. Drew	12711	1157	1133.0	0.091	0.089	102.1
Mark Teahen	5593	553	544.8	0.099	0.097	101.5
Brian Giles	10894	900	908.1	0.083	0.083	99.1
Kosuke Fukudome	4981	515	521.4	0.103	0.105	98.8
Ichiro Suzuki	10925	1098	1116.5	0.101	0.102	98.3
Ken Griffey	5169	452	461.7	0.087	0.089	97.9
Corey Hart	11591	1081	1121.0	0.093	0.097	96.4
Milton Bradley	4013	437	455.1	0.109	0.113	96.0
Vladimir Guerrero	6460	670	697.8	0.104	0.108	96.0
Michael Cuddyer	10977	940	983.7	0.086	0.090	95.6
Geoff Jenkins	4295	388	406.2	0.090	0.095	95.5
Nick Markakis	16303	1503	1574.9	0.092	0.097	95.4
Shawn Green	4313	423	446.3	0.098	0.103	94.8
Jeff Francoeur	15257	1462	1541.6	0.096	0.101	94.8
Andre Ethier	9976	850	899.7	0.085	0.090	94.5
Bobby Abreu	13317	1294	1369.1	0.097	0.103	94.5
Jose Guillen	8014	747	791.7	0.093	0.099	94.4
Xavier Nady	6061	583	643.0	0.096	0.106	90.7
Juan Encarnacion	4275	344	380.6	0.080	0.089	90.4
Brad Hawpe	13340	1024	1144.4	0.077	0.086	89.5
Magglio Ordonez	10859	1022	1156.1	0.094	0.106	88.4

The bottom of the list does tend to be on the older side. I’m always surprised when Ichiro doesn’t do that well in PMR, especially in rightfield. However, he’s no longer a young man, so given average standing, it’s not bad at all. Range Runs likes Ichiro at lot better than objective PMR. Nelson Cruz, Randy Winn and Jay Bruce rate highly there, too, so there’s agreement as well.

February 24, 2011

Five-Year PMR, Third Basemen

Leave a reply

The five year look at PMR using an objective probabilistic model of range continues with third baseman. Here is the data for the teams at third over the five year span:

Team third baseman PMR, 2006-2010, four parameter objective model built on visiting team data.
Team	In Play	Actual Outs	Predicted Outs	Actual DER	Predicted DER	Index
DET	16601	2278	1985.8	0.137	0.120	114.7
BOS	15756	2127	1963.4	0.135	0.125	108.3
BAL	16429	2070	1965.7	0.126	0.120	105.3
SEA	17219	2155	2056.5	0.125	0.119	104.8
WAS	16758	2079	1994.8	0.124	0.119	104.2
ANA	15651	1882	1827.8	0.120	0.117	103.0
TBA	15762	1991	1934.9	0.126	0.123	102.9
OAK	16234	2040	1985.6	0.126	0.122	102.7
SLN	17200	2120	2066.4	0.123	0.120	102.6
PIT	16839	2200	2153.5	0.131	0.128	102.2
LAN	16161	2005	1986.0	0.124	0.123	101.0
MIL	16036	1954	1936.2	0.122	0.121	100.9
SFN	15375	1926	1911.8	0.125	0.124	100.7
KCA	16406	1892	1878.3	0.115	0.114	100.7
PHI	16533	2100	2104.9	0.127	0.127	99.8
CIN	16375	1895	1900.1	0.116	0.116	99.7
CHN	15218	1844	1866.0	0.121	0.123	98.8
ATL	16062	1875	1899.5	0.117	0.118	98.7
MIN	15919	1984	2020.2	0.125	0.127	98.2
NYA	15941	1859	1893.5	0.117	0.119	98.2
NYN	15839	1936	1990.1	0.122	0.126	97.3
COL	16985	1885	1953.7	0.111	0.115	96.5
HOU	15943	1922	1997.4	0.121	0.125	96.2
CLE	16767	2048	2131.1	0.122	0.127	96.1
TOR	16319	1977	2060.7	0.121	0.126	95.9
CHA	17128	2136	2248.1	0.125	0.131	95.0
SDN	16032	1846	1960.5	0.115	0.122	94.2
ARI	16245	1851	1966.5	0.114	0.121	94.1
TEX	16725	1831	1966.0	0.109	0.118	93.1
FLO	16004	1811	1955.9	0.113	0.122	92.6

If a team used Adrian Beltre over the the last five seasons, they were pretty good at third base. Detroit leads the pack, thanks to Brandon Inge:

Individual third basemen PMR, 2006-2010, four parameter objective model built on visiting team data.
Fielder	In Play	Actual Outs	Predicted Outs	Actual DER	Predicted DER	Index
Brandon Inge	12901	1772	1534.5	0.137	0.119	115.5
Evan Longoria	7774	1054	943.6	0.136	0.121	111.7
Scott Rolen	12222	1579	1461.0	0.129	0.120	108.1
Eric Chavez	4895	639	597.5	0.131	0.122	106.9
Melvin Mora	11353	1459	1369.8	0.129	0.121	106.5
Mike Lowell	9823	1298	1221.1	0.132	0.124	106.3
Nick Punto	4365	598	562.4	0.137	0.129	106.3
Jack Hannahan	4222	543	512.6	0.129	0.121	105.9
Adrian Beltre	14499	1840	1745.0	0.127	0.120	105.4
Chone Figgins	7441	905	859.0	0.122	0.115	105.3
Ryan Zimmerman	14647	1817	1745.1	0.124	0.119	104.1
Ian Stewart	5311	639	614.6	0.120	0.116	104.0
Pedro Feliz	12884	1672	1606.9	0.130	0.125	104.0
Andy LaRoche	5619	732	704.9	0.130	0.125	103.8
Alex Gordon	6409	753	726.3	0.117	0.113	103.7
Jhonny Peralta	4124	528	513.9	0.128	0.125	102.7
Joe Crede	7505	1032	1009.2	0.138	0.134	102.3
Mark Teahen	5416	646	640.9	0.119	0.118	100.8
Chipper Jones	10575	1243	1245.2	0.118	0.118	99.8
Alex Rodriguez	12766	1476	1508.3	0.116	0.118	97.9
Ty Wigginton	4642	546	557.9	0.118	0.120	97.9
Casey Blake	10279	1257	1289.0	0.122	0.125	97.5
Jose Bautista	6348	802	825.7	0.126	0.130	97.1
David Wright	14931	1808	1871.4	0.121	0.125	96.6
Aramis Ramirez	11546	1351	1405.1	0.117	0.122	96.2
Edwin Encarnacion	11086	1264	1318.5	0.114	0.119	95.9
Chad Tracy	4557	522	547.6	0.115	0.120	95.3
Pablo Sandoval	4718	559	591.1	0.118	0.125	94.6
Kevin Kouzmanoff	11031	1268	1346.5	0.115	0.122	94.2
Mark Reynolds	9924	1130	1205.9	0.114	0.122	93.7
Troy Glaus	8065	923	989.4	0.114	0.123	93.3
Garrett Atkins	9743	1045	1120.7	0.107	0.115	93.2
Hank Blalock	4054	457	490.9	0.113	0.121	93.1
Casey McGehee	4211	467	508.5	0.111	0.121	91.8
Miguel Cabrera	6241	715	789.2	0.115	0.126	90.6
Michael Young	5549	597	663.5	0.108	0.120	90.0
Jorge Cantu	4607	485	562.2	0.105	0.122	86.3

Of all the positions run so far, this one matches my expectations the best. The only person near the top of the list that strikes me as not belonging is Melvin Mora, and indeed FanGraphs ranks him low in terms of range runs.

You can also see how much Adrian Beltre will help the Rangers defensively. For every 100 outs Young would field, Beltre should pick up 116 to 117. Over 200 Young outs, Beltre will turn over a full game’s worth.

February 23, 2011

Five-Year PMR, Centerfielders

2 Replies

The five year look at fielding using an objective probabilistic model of range continues with centerfielders. As always, we start with the complete team data:

Team centerfielder PMR, 2006-2010, four parameter objective model built on visiting team data.
Team	In Play	Actual Outs	Predicted Outs	Actual DER	Predicted DER	Index
NYN	14066	2194	1981.0	0.156	0.141	110.8
ANA	15635	2198	2109.0	0.141	0.135	104.2
MIN	15363	2219	2130.9	0.144	0.139	104.1
SEA	12671	2257	2171.5	0.178	0.171	103.9
HOU	14431	2113	2049.3	0.146	0.142	103.1
FLO	16302	2163	2101.4	0.133	0.129	102.9
DET	16457	2195	2141.6	0.133	0.130	102.5
CLE	16721	2108	2061.6	0.126	0.123	102.2
CIN	14823	2172	2132.6	0.147	0.144	101.8
CHN	14225	1975	1944.1	0.139	0.137	101.6
SDN	16476	2117	2090.9	0.128	0.127	101.2
ATL	15114	1949	1926.2	0.129	0.127	101.2
COL	12832	1949	1931.8	0.152	0.151	100.9
PHI	13987	2008	1995.2	0.144	0.143	100.6
WAS	14229	2228	2221.1	0.157	0.156	100.3
BOS	16397	2063	2059.8	0.126	0.126	100.2
SLN	14393	1942	1937.5	0.135	0.135	100.2
SFN	13812	2116	2116.5	0.153	0.153	100.0
NYA	12545	2093	2101.0	0.167	0.167	99.6
KCA	16621	2147	2160.6	0.129	0.130	99.4
BAL	14513	2229	2244.9	0.154	0.155	99.3
TBA	15993	2195	2210.7	0.137	0.138	99.3
CHA	15444	2045	2064.3	0.132	0.134	99.1
PIT	15893	2057	2092.6	0.129	0.132	98.3
MIL	14597	2089	2144.3	0.143	0.147	97.4
ARI	16596	2040	2097.5	0.123	0.126	97.3
LAN	15463	1869	1926.8	0.121	0.125	97.0
OAK	12127	2074	2146.9	0.171	0.177	96.6
TOR	14658	1880	1957.1	0.128	0.134	96.1
TEX	16113	2060	2165.3	0.128	0.134	95.1

The Mets are just off the chart in centerfield, thanks to Carlos Beltran.

996815061385.90.1510.139108.7715410991015.70.1540.142108.2854511901100.70.1390.129108.16158889828.40.1440.135107.35042913862.40.1810.171105.94643691660.60.1490.142104.67094940903.90.1330.127104.04290615596.00.1430.139103.21151116431592.80.1430.138103.14150727707.30.1750.170102.84150535522.10.1290.126102.51342518871843.30.1410.137102.45666943921.60.1660.163102.31175715131488.70.1290.127101.6799911751161.20.1470.145101.26094844835.00.1380.137101.1976413531338.70.1390.137101.1762211851174.90.1550.154100.96687913906.60.1370.136100.7712310381032.80.1460.145100.51044515451543.60.1480.148100.11147515401544.60.1340.13599.75741898900.40.1560.15799.7716510011004.50.1400.14099.64389565568.00.1290.12999.56447802809.60.1240.12699.17686951969.00.1240.12698.1820710561076.10.1290.13198.11166615021539.20.1290.13297.66051765791.30.1260.13196.74883643665.50.1320.13696.64874595616.60.1220.12796.55253758788.00.1440.15096.21251115721646.20.1260.13295.54990636675.90.1270.13594.14142602649.10.1450.15792.7

Individual centerfielder PMR, 2006-2010, four parameter objective model built on visiting team data.
Fielder	In Play	Actual Outs	Predicted Outs	Actual DER	Predicted DER	Index
Carlos Beltran
Willy Taveras
Coco Crisp
Carlos Gomez
Franklin Gutierrez
Alexis Rios
Andruw Jones
Denard Span
Torii Hunter
Rajai Davis
Alfredo Amezaga
Curtis Granderson
Melky Cabrera
Grady Sizemore
Shane Victorino
Jim Edmonds
B.J. Upton
Adam Jones
Gary Matthews
Michael Bourn
Aaron Rowand
Mike Cameron
Corey Patterson
Marlon Byrd
Joey Gathright
Juan Pierre
Matt Kemp
Nate McLouth
Chris Young
Cody Ross
Josh Hamilton
Jacoby Ellsbury
David DeJesus
Vernon Wells
Andrew McCutchen
Mark Kotsay

The Twins should be proud that they produced fielding gems Torii Hunter and Denard Span. I’m somewhat surprised Jacoby Ellsbury does poorly, but maybe the Red Sox realized that when they moved him to left at the start of the 2010 season. Josh Hamilton clearly should not be the regular CF for the Rangers.

FanGraphs’ Range Runs doesn’t like Wells or Kotsay, but rates Beltran a lot lower. In general, however, it looks like there is more agreement here than among the second baseman.

February 22, 2011

Five Year PMR, Second Basemen

4 Replies

My five-year survey of fielders using objective PMR continues with second basemen. (You can find all Probabilistic Model of Range posts here.) We start with the composite team view:

Team second baseman PMR, 2006-2010, four parameter objective model built on visiting team data.
Team	In Play	Actual Outs	Predicted Outs	Actual DER	Predicted DER	Index
NYA	16081	2674	2462.3	0.166	0.153	108.6
COL	17011	2796	2590.3	0.164	0.152	107.9
ARI	16276	2672	2498.8	0.164	0.154	106.9
PHI	16614	2596	2481.2	0.156	0.149	104.6
TEX	16691	2721	2611.7	0.163	0.156	104.2
OAK	16203	2653	2563.1	0.164	0.158	103.5
TOR	16418	2663	2590.4	0.162	0.158	102.8
CLE	16901	2687	2616.5	0.159	0.155	102.7
NYN	16069	2476	2428.2	0.154	0.151	102.0
HOU	15999	2467	2427.4	0.154	0.152	101.6
BOS	15779	2509	2478.1	0.159	0.157	101.2
SEA	16591	2562	2531.3	0.154	0.153	101.2
ATL	16587	2585	2554.0	0.156	0.154	101.2
MIL	16105	2373	2351.5	0.147	0.146	100.9
FLO	16105	2493	2474.4	0.155	0.154	100.8
LAN	16188	2573	2555.3	0.159	0.158	100.7
WAS	16839	2518	2517.7	0.150	0.150	100.0
ANA	15624	2549	2554.1	0.163	0.163	99.8
DET	16562	2593	2598.1	0.157	0.157	99.8
MIN	16082	2630	2635.0	0.164	0.164	99.8
CIN	16377	2478	2484.3	0.151	0.152	99.7
CHA	16459	2541	2552.9	0.154	0.155	99.5
CHN	15242	2340	2379.5	0.154	0.156	98.3
SFN	15403	2308	2360.4	0.150	0.153	97.8
SLN	17217	2555	2620.3	0.148	0.152	97.5
TBA	15843	2408	2472.2	0.152	0.156	97.4
BAL	17338	2474	2573.4	0.143	0.148	96.1
SDN	16048	2442	2566.9	0.152	0.160	95.1
KCA	16381	2448	2575.9	0.149	0.157	95.0
PIT	16877	2218	2472.8	0.131	0.147	89.7

The Yankees dominate this list, while I’m not surprised the Royals, a very poor defensive team, come up near the bottom. The Giants, who like to play veterans, tend toward the bottom as well. Next, the individual players, the regulars and semi regulars:

Individual second baseman PMR, 2006-2010, four parameter objective model built on visiting team data.
Fielder	In Play	Actual Outs	Predicted Outs	Actual DER	Predicted DER	Index
Orlando Hudson	12946	2221	2031.8	0.172	0.157	109.3
Robinson Cano	14648	2415	2237.8	0.165	0.153	107.9
Mark Ellis	11955	2018	1894.1	0.169	0.158	106.5
Ian Kinsler	12629	2103	1976.6	0.167	0.157	106.4
Clint Barmes	5285	855	805.1	0.162	0.152	106.2
Jamey Carroll	6028	1000	943.2	0.166	0.156	106.0
Jose Lopez	11573	1878	1772.7	0.162	0.153	105.9
Chase Utley	14380	2270	2152.4	0.158	0.150	105.5
Adam Kennedy	7914	1276	1224.0	0.161	0.155	104.2
Dustin Pedroia	10181	1645	1588.2	0.162	0.156	103.6
Kazuo Matsui	7745	1207	1167.3	0.156	0.151	103.4
Placido Polanco	10775	1723	1683.6	0.160	0.156	102.3
Aaron Hill	12243	1968	1928.3	0.161	0.158	102.1
Howie Kendrick	8266	1375	1355.0	0.166	0.164	101.5
Josh Barfield	5445	879	874.1	0.161	0.161	100.6
Dan Uggla	15092	2318	2318.3	0.154	0.154	100.0
Rickie Weeks	10082	1457	1459.3	0.145	0.145	99.8
Brandon Phillips	14595	2213	2217.3	0.152	0.152	99.8
Kelly Johnson	9706	1474	1481.7	0.152	0.153	99.5
Felipe Lopez	5964	898	904.0	0.151	0.152	99.3
Aaron Miles	5054	754	767.0	0.149	0.152	98.3
Tadahito Iguchi	6725	1004	1030.2	0.149	0.153	97.5
Mark Grudzielanek	7038	1052	1085.3	0.149	0.154	96.9
Ronnie Belliard	7049	1025	1057.5	0.145	0.150	96.9
Luis Castillo	10720	1608	1668.0	0.150	0.156	96.4
Brian Roberts	13706	1945	2038.1	0.142	0.149	95.4
Ray Durham	6439	925	978.5	0.144	0.152	94.5
Freddy Sanchez	10154	1398	1483.3	0.138	0.146	94.2
Jeff Kent	6600	970	1043.5	0.147	0.158	93.0

I like that the peripatetic Orlando Hudson comes up at the top of this list. He played for three teams over these five seasons. I sometimes wonder if the way the model is built makes fielders on strong offensive teams look good (like Robinson Cano). The Yankees send a number of good left-handed hitters to the plate, so when a model is built for Yankee Stadium, one might expect the ability for a second baseman to field a ball there is tougher than it may be in reality. Hudson, coming out high on the list, playing for fairly weak offensive teams argues against that.

At the other end is an old Jeff Kent. Right smack in the middle, however, is Dan Uggla, who owns a reputation as a poor fielder. In fact, there is a big discrepancy here between the objective model and FanGraphs range run ranking. The latter puts Cano, Uggla and Orlando Hudson all near the bottom of the list. This is fairly huge. Looking at Bill James Online, Hudson was near the top in +/- in three of the five years.

Here’s a closer look at Hudson and Cano:

Type of Batted Ball	Hudson’s Index	Cano’s Index
Ground	103.2	106.5*
Line	125.8	112.6
Pop	131.5*	112.6

*Highest

So Hudson caught a lot of line drives and is a pop up hog. Cano’s ranking looks pretty good, however. I don’t know if you can be lucky with line drives over a five year period. Maybe Orlando is really good at positioning himself.

February 21, 2011

Five Year PMR, Shortstops

5 Replies

The long term fielding study continues with a look at shortstops. (See yesterday’s post for background.) The following table shows how the teams performed as a whole at the position over the last five years:

Team shortstop PMR, 2006-2010, four parameter objective model built on visiting team data.
Team	In Play	Actual Outs	Predicted Outs	Actual DER	Predicted DER	Index
COL	17144	2884	2684.1	0.168	0.157	107.4
ATL	16331	2696	2544.0	0.165	0.156	106.0
MIN	16153	2803	2670.7	0.174	0.165	105.0
SLN	17324	2828	2721.8	0.163	0.157	103.9
TEX	16725	2713	2612.6	0.162	0.156	103.8
LAN	16314	2744	2643.8	0.168	0.162	103.8
MIL	16139	2622	2535.9	0.162	0.157	103.4
PIT	16925	2804	2746.7	0.166	0.162	102.1
TOR	16460	2617	2574.3	0.159	0.156	101.7
KCA	16492	2635	2595.6	0.160	0.157	101.5
PHI	17622	2606	2569.6	0.148	0.146	101.4
SDN	16243	2599	2564.5	0.160	0.158	101.3
ANA	15703	2510	2481.2	0.160	0.158	101.2
HOU	16079	2637	2612.2	0.164	0.162	101.0
CHA	16511	2707	2680.3	0.164	0.162	101.0
CHN	15291	2499	2473.8	0.163	0.162	101.0
BAL	16583	2642	2643.3	0.159	0.159	100.0
DET	16622	2566	2575.4	0.154	0.155	99.6
SEA	16618	2590	2605.7	0.156	0.157	99.4
CLE	16968	2764	2784.9	0.163	0.164	99.3
CIN	16487	2497	2521.5	0.151	0.153	99.0
ARI	17295	2524	2554.6	0.146	0.148	98.8
TBA	16778	2420	2451.4	0.144	0.146	98.7
WAS	16945	2623	2664.9	0.155	0.157	98.4
FLO	17005	2443	2486.5	0.144	0.146	98.2
BOS	15942	2448	2498.9	0.154	0.157	98.0
OAK	16251	2588	2646.0	0.159	0.163	97.8
SFN	15499	2509	2565.1	0.162	0.166	97.8
NYA	16663	2364	2496.1	0.142	0.150	94.7
NYN	16262	2491	2641.2	0.153	0.162	94.3

I’m surprised Texas does this well, as Michael Young was the shortstop for a good period of time here, and he’s not known for his defensive ability. Minnesota’s ranking makes some sense as the team is known for good defense. Boston’s rotating players at the position took a toll. It’s been a terrible half-decade in New York. Despite their high-profile shortstops, this study show they were poor fielders.

On to the individuals. The following table shows shortstops on the field for at least 5000 balls in play over the last five seasons. That gives us the regulars and semi-regulars.

Fielder	In Play	Actual Outs	Predicted Outs	Actual DER	Predicted DER	Index
Brendan Ryan	5653	1004	885.8	0.178	0.157	113.3
Yunel Escobar	8574	1464	1332.7	0.171	0.155	109.9
Elvis Andrus	5577	951	878.4	0.171	0.158	108.3
Troy Tulowitzki	11389	1928	1779.9	0.169	0.156	108.3
Jack Wilson	10305	1759	1665.8	0.171	0.162	105.6
Jason Bartlett	12100	1939	1841.6	0.160	0.152	105.3
Adam Everett	7554	1263	1205.7	0.167	0.160	104.8
Rafael Furcal	11168	1901	1819.9	0.170	0.163	104.5
Alexei Ramirez	6102	1008	968.8	0.165	0.159	104.0
Cristian Guzman	6351	1012	976.9	0.159	0.154	103.6
Erick Aybar	7345	1197	1159.1	0.163	0.158	103.3
Cesar Izturis	9194	1514	1472.7	0.165	0.160	102.8
Ronny Cedeno	7367	1210	1182.9	0.164	0.161	102.3
Jimmy Rollins	14724	2203	2154.5	0.150	0.146	102.3
J.J. Hardy	10454	1688	1654.7	0.161	0.158	102.0
Marco Scutaro	8509	1352	1334.1	0.159	0.157	101.3
Alex Gonzalez	9437	1507	1489.8	0.160	0.158	101.2
Michael Young	9493	1498	1483.6	0.158	0.156	101.0
Ryan Theriot	7811	1263	1255.1	0.162	0.161	100.6
Jhonny Peralta	10774	1782	1776.7	0.165	0.165	100.3
John McDonald	5264	811	811.2	0.154	0.154	100.0
Juan Uribe	8144	1329	1337.5	0.163	0.164	99.4
Omar Vizquel	7598	1247	1255.2	0.164	0.165	99.3
Carlos Guillen	5410	854	861.0	0.158	0.159	99.2
Bobby Crosby	6977	1115	1128.6	0.160	0.162	98.8
Orlando Cabrera	14196	2243	2276.2	0.158	0.160	98.5
Hanley Ramirez	15208	2191	2225.0	0.144	0.146	98.5
Julio Lugo	6982	1074	1090.3	0.154	0.156	98.5
Khalil Greene	8178	1264	1284.9	0.155	0.157	98.4
Yuniesky Betancourt	15070	2318	2372.4	0.154	0.157	97.7
Miguel Tejada	12527	1963	2009.9	0.157	0.160	97.7
Edgar Renteria	11358	1746	1797.3	0.154	0.158	97.1
Stephen Drew	13184	1869	1946.1	0.142	0.148	96.0
David Eckstein	5814	880	921.7	0.151	0.159	95.5
Jose Reyes	12550	1930	2041.1	0.154	0.163	94.6
Felipe Lopez	5884	870	931.7	0.148	0.158	93.4
Derek Jeter	14916	2076	2241.1	0.139	0.150	92.6

You can see why Texas ranked so high. Michael Young played a little above league average, while Elvis Andrus came out near the top of the rankings. Of the longer term shortstops, Troy Tulowitzki brought the Rockies up to the number one team ranking.

Derek Jeter ranks last, despite his good 2009 season, which brings me back to the whole idea of presenting a more objective model, biases in scoring hit location make some players look better. This system shows Jeter turning 165 fewer balls into outs than expected. Looking at FanGraphs and Baseball Reference, both which use batted ball location data, this objective model is very much in line with their results. Jeter is a poor fielder compared to the average shortstop.

The big disagreement comes with Omar Vizquel. FanGraphs rates his range very high, and BR shows him positively contributing to the defense. Since Omar has such a great reputation as a fielder, maybe scorers are putting balls he doesn’t get to more out of reach than than they should be. It would make sense, at Omar’s age, that he would not be that great defensively anymore.

As always, I’m interested in your thoughts.

February 20, 2011

Five Year PMR

Leave a reply

As a follow up to my last post on objective Probabilistic Model of Range (PMR), I down loaded the latest retrosheet data, complete through 2010. I thought I might actually extend the model with vectors, but few events have hit location attached.

On the other hand, retrosheet data is very consistent from year to year with classifying balls in play as grounders, flies, liners and pops. So I added that to the model. The model now contains three objective parameters, batter-handedness, pitcher handedness, and stadium, and well as the slightly subjective batted ball type.

I also decide to look at these models longer term, so I’ll do a series of five-year studies, covering fielding from 2006 to 2010. For each model for a season, I used five years of data, not including the year in question. This way, none of the data for the season in question was used to train the model. I used the 2005 through 2010 data. For example, the model for 2006 is built from 2005, 2007, 2008, 2009, 2010. The 2009 data is built from 2005-2008 and 2010. The only exception are the models for Target Field, which only existed in 2010. I also only built the model with data on the visiting fielders, so a great or terrible defender for a team would not influence the model that much.

Just to review, PMR determines the probability of a batted ball being turned into an out based on a set of parameters. Adding those probabilities up for each ball in play gives us the expected number of outs. Calculating an index by the formula (100*actual outs/predicted outs) allows a ranking, with number over 100 good and numbers under 100 poor. When I used BIS data, I included a direction for the ball, and a measure of distance. Those elements are subjective, however, so leaving them out removes the biases of the scorers. Since the number of balls in play an individual fielder can handle is small in any year, by looking at a longer term model we should get a better picture of who are the best glove men.

Compiling the data for five years, the Red Sox were the best defensive team in the majors.

Team PMR, 2006-2010
Team	In Play	Actual Outs	Predicted Outs	Actual DER	Predicted DER	Index
BOS	17014	11800	11524.4	0.694	0.677	102.4
NYA	17125	11878	11636.5	0.694	0.680	102.1
COL	17595	12096	11894.7	0.687	0.676	101.7
NYN	17490	12168	11973.4	0.696	0.685	101.6
TOR	17110	11874	11691.9	0.694	0.683	101.6
ANA	17487	12023	11880.8	0.688	0.679	101.2
TEX	17830	12260	12117.6	0.688	0.680	101.2
DET	17710	12214	12084.1	0.690	0.682	101.1
SEA	17881	12370	12253.6	0.692	0.685	101.0
SLN	17996	12452	12327.8	0.692	0.685	101.0
ATL	17229	11918	11816.6	0.692	0.686	100.9
SFN	16821	11752	11653.6	0.699	0.693	100.8
PHI	17548	12164	12077.2	0.693	0.688	100.7
TBA	17142	11856	11768.6	0.692	0.687	100.7
ARI	17339	11945	11874.9	0.689	0.685	100.6
CHN	16667	11615	11544.6	0.697	0.693	100.6
LAN	16889	11745	11690.8	0.695	0.692	100.5
CLE	18081	12369	12310.3	0.684	0.681	100.5
OAK	17441	12122	12075.0	0.695	0.692	100.4
MIN	17929	12336	12283.0	0.688	0.685	100.4
SDN	17237	12032	12019.3	0.698	0.697	100.1
KCA	17765	12096	12096.4	0.681	0.681	100.0
BAL	18048	12392	12430.5	0.687	0.689	99.7
WAS	18063	12432	12469.4	0.688	0.690	99.7
FLO	17406	11830	11860.0	0.680	0.681	99.7
CIN	17540	12098	12143.1	0.690	0.692	99.6
MIL	17408	11921	11968.0	0.685	0.687	99.6
PIT	18280	12382	12481.3	0.677	0.683	99.2
HOU	17443	11939	12083.1	0.684	0.693	98.8
CHA	17630	12044	12197.7	0.683	0.692	98.7

I was somewhat surprised to see the Yankees second. The team has brought in a few good defenders in recent years. The Rockies were the best team in the NL, and their speedy outfielders make a difference in the big park. The White Sox inhabit the bottom of the rankings, with the Astros bringing up the rear in the NL.

Over the next week studies will include each position, as well as defenses behind pitchers.

February 10, 2011

Objective PMR

3 Replies

In November I made note of this post, Jeter Uncertainty Principle, which linked to a Baseball Prospectus article that wondered if the batted ball fielding systems were overestimating the range of players.

the spread of observed performance in metrics like UZR and DRS is much, much smaller than that of metrics like nFRAA (or Tom Tango’s With Or Without You system, which is similarly down on Jeter’s fielding ability).

This got me thinking, but unfortunately my computer crashed and I lost my MSSql database with my fielding data. I recently downloaded the retrosheet data through 2009, and decided to do an experiment.

The Probabilistic Model of Range uses six parameters to determine the probability of a ball in play being turned into an out. Three of those, the direction, velocity, and batted ball type, are subjective measures. Three of them, the handedness of the batter, the handedness of the pitcher, and the park are objective. If I used just those three objective measures to construct the model, would I see a bigger spread in the data?

I last computed PMR in 2008, and here is the team listing from that year. I built the retrosheet model a bit differently. I only use visiting fielders so a great or terrible home fielder doesn’t over influence the model, but I used a different set of data to build the model. Instead of using the same year, I used the previous three seasons and the following season. So for 2008, I used the 2005, 2006, 2007 and 2009 seasons. Here are the results for the teams:

Objective PMR, 2008 Teams, model built with visiting team data from 2005, 2006, 2007, and 2009
Team	In Play	Actual Outs	Predicted Outs	Actual DER	Predicted DER	Index
BOS	4229	2954	2825.582	0.699	0.668	104.5
TBA	4265	3024	2908.787	0.709	0.682	104.0
TOR	4217	2962	2881.598	0.702	0.683	102.8
CHN	4164	2930	2857.973	0.704	0.686	102.5
ANA	4374	3024	2973.781	0.691	0.680	101.7
OAK	4292	2992	2944.710	0.697	0.686	101.6
MIL	4362	3048	3006.376	0.699	0.689	101.4
SLN	4604	3198	3159.799	0.695	0.686	101.2
FLO	4342	3005	2970.709	0.692	0.684	101.2
PHI	4399	3061	3023.389	0.696	0.687	101.2
ATL	4392	3040	3008.062	0.692	0.685	101.1
KCA	4416	3039	3008.497	0.688	0.681	101.0
NYN	4341	3030	3004.247	0.698	0.692	100.9
NYA	4351	2963	2941.234	0.681	0.676	100.7
COL	4535	3075	3056.998	0.678	0.674	100.6
CLE	4514	3094	3078.160	0.685	0.682	100.5
HOU	4298	2999	2988.713	0.698	0.695	100.3
BAL	4539	3120	3112.376	0.687	0.686	100.2
DET	4536	3107	3104.271	0.685	0.684	100.1
WAS	4420	3044	3044.363	0.689	0.689	100.0
LAN	4277	2951	2950.631	0.690	0.690	100.0
MIN	4584	3141	3143.573	0.685	0.686	99.9
PIT	4688	3166	3187.588	0.675	0.680	99.3
ARI	4236	2903	2927.417	0.685	0.691	99.2
SEA	4514	3069	3095.687	0.680	0.686	99.1
SDN	4426	3080	3109.792	0.696	0.703	99.0
SFN	4237	2904	2942.056	0.685	0.694	98.7
CHA	4395	3006	3058.851	0.684	0.696	98.3
TEX	4671	3126	3185.941	0.669	0.682	98.1
CIN	4313	2904	2990.839	0.673	0.693	97.1

The number of balls in play do not match up precisely between BIS and Retrosheet, and I’m exploring why (in the case of Baltimore, it was a foul ball error). The ordering is different, which isn’t surprising given the different model and the way the model was built. What I would like you to notice, however, is that the spread of the index is indeed wider than the original PMR model shows. So a probabilistic model that uses only objective parameters also shows a larger spread.

I’m going to try to get more comfortable with the data, making sure the differences are just foul pops. Then, I’ll build some positional models to see what happens there.

August 30, 2010

Counting Fielder Outs

3 Replies

Rob Neyer attended the PITCHf/x Summit, and talks about some ideas for measuring fielding with Hit Tracker:

As Greg pointed out, all of our current “new” defensive metrics are “zone-based”; that is, they begin by separating the field into distinct zones, noting in which zone a play has been made, and then apportioning credit (or not) when a fielder makes a play (or doesn’t) in that zone. I’m simplifying, of course, but essentially every reputable system now in use shares two significant defects: the “zones” don’t fit neatly into today’s highly variable outfield dimensions, and the systems don’t have any way to account for the fielder’s starting position.

What’s more, Greg argued that the great majority of plays made by outfielders are essentially irrelevant. If an outfielder doesn’t have to move to catch a line drive, that play still counts in his favor. If he can’t catch that same fly ball because he was pulled way in with the potential winning run on third base in the ninth inning, that play still counts against him.

What Greg proposes is stripping out the plays that any outfielder could make, and the plays that perhaps no outfielder could make, and look instead at only the plays that might be made. And this might be done given FIELDf/x’s ability to record an outfielder’s starting position, the hang time of the batted ball, and the baseball’s landing point.

No, nothing needs to be stripped out. The starting position of the fielder, the hang time and the landing position of the ball become part of the probabilistic model. It’s going to be very interesting to see how the two systems measure fielding.

Note that one isn’t necessarily better than the other. As I’ve noted before, my Probabilistic Model of Range doesn’t measure just range, but the ability of a fielder to position himself well. From the description above, True Range would measure the ability of a fielder to move to a batted ball, his actual range. If the fielder doesn’t position himself well, there should be a penalty for that also, and the traditional zone based system catches those.

If we see a player who does well in a zone system but poorly in a range system, we might conclude he (or his coaches) position him well.

March 30, 2009

Charting Range

Leave a reply

Dan Turkenkopf comes up with a nice graphic presentation of the Probabilistic Model of Range for the Yankees outfield.

March 30, 2009

Defensive Mention

Leave a reply

Thanks to Geoff Baker for a mention of PMR in his article on defense:

Dewan introduced a well-known Plus/Minus system a few years back, then used it as the main component of his new DRS metric. But there are other top systems, like Mitchel Lichtman’s Ultimate Zone Rating (UZR) or David Pinto’s Probabilistic Model of Range (PMR), that intelligent baseball minds swear are just as good or superior.

“I like to look at all of them,” said Tony Blengino, a special assistant to Mariners general manager Jack Zduriencik, who oversees the team’s new stats branch. “It’s kind of an emerging frontier of statistical analysis, and I think they all bring something to the table. More often than not they point in similar directions. Every now and then, they don’t. But I think there’s progress being made in the fielding analysis. I think they all have something to say.”

December 9, 2008

Comparing Fielding Systems

Leave a reply

Tom Tango compares both UZR models with PMR at The Book Blog.