A's Grand Unified Projection: Batters
There are a few major projection systems out there that all claim to be the best. They all present plausible theories and evidence that they're more accurate than the others. PECOTA seems to have risen to the top, but it's still just one system.
I decided to stop trying to figure out which one to use, and just use four of them at once.
After importing all of the data into a spreadsheet, I unified it to produce one projection for a player. I figure it's better to have four experts reach a compromised consensus on how something will turn out, than to rely on just one or another. We'll see how it turns out at the end of October.
I tried to post the data in a table in here, but the "scoop" software AN runs doesn't like my table formatting at all... I swear, I'm not trying to lead you all away from AN, it just wouldn't format correctly.
So... here's a link to the table, in a nice pretty, readable setting.
http://thepastime.net/2007/01/16/the-grand-unified-projection-batters/
Now, I'm not claiming that I've created a system that's better than the sum of its parts. It may in fact be less than the sum of its parts. We'll have to see how it looks at the end of October.
If you'd like to see the data I'm working with, or are curious as to my process, let me know. I can share the spreadsheet on Google docs if there's some interest. I don't want to be too open about it, though, since I'm using BP's brand new PECOTA data, and they really don't like it being distributed gratis...
Anyway, what do you think? Even though I'm probably his biggest supporter, I'm surprised at how highly all four systems regard Dan Johnson. It also suggests not to get too excited about Mike Piazza and his 15 projected homers.
I'll post the pitcher projections either later today or tomorrow.
Update [2007-1-16 21:9:57 by Nebraska]: Weighted projections are now up.
0 recs |
46 comments
Comments
Interesting stuff!
--
by rungood on Jan 16, 2007 5:51 AM PST reply actions 0 recs
Great job! But the team can't hit...
by WaddellCanseco on Jan 16, 2007 7:42 AM PST reply actions 0 recs
Neat entry
by apilgrim on Jan 16, 2007 8:38 AM PST reply actions 0 recs
PECOTA doesn't
They make no assumptions on roster use. They ABs they are assigned are based on recent history, adjusted based on various characteristics of the player.
Pizza had about 300 ABs because he has recently averaged about 300 ABs and he is old.
by devo on Jan 16, 2007 1:40 PM PST up reply actions 0 recs
Thanks for sharing that work...
We know players like DJ, Durazo, Keilty, & Bocachica excel in platoons and I suspect Barton and Robnett are two more...
...Any evidence?
by Billy Ball 2005 on Jan 16, 2007 10:14 AM PST reply actions 0 recs
Ha
With an offense like that i smell sub .500 season too.
I am much less worried about the pitching than the hitting now.
Very interesting projections :D
by sublimeguyjohn on Jan 16, 2007 10:25 AM PST reply actions 0 recs
Great stuff Nebraska. Very interesting....
I am very worried about this O and have been all off-season. I think everything needs to break right for the O as is currently sits to be succesful. That's scary, cuz rarely does everything break right
by OaktownPower on Jan 16, 2007 10:42 AM PST reply actions 0 recs
undo, undo
those numbers just made me get under my desk and curl into a ball.
by Eric in Atlanta on Jan 16, 2007 10:56 AM PST reply actions 0 recs
the parts
perhaps the four could be weighed based on how accurate they have been in the past. i don't know if this information exists for previous years, but the 2006 results are here:
http://www.baseballthinkfactory.org/files/newsstand/discussion/2006_projection_results/
Hitters
PECOTA .736
BIS/James .685
ZiPS .684
Chone .677
Pitchers
ZiPS .459
PECOTA .451
BIS/James .445
Chone .424
here's something for 2003, but 4 of the 6 are different:
http://www.baseballprospectus.com/article.php?articleid=2515
by xbhaskarx on Jan 16, 2007 11:26 AM PST reply actions 0 recs
I've been considering weighting them
by Ryan Armbrust on Jan 16, 2007 11:57 AM PST up reply actions 0 recs
two more links
the diamond mind projection system was close to pecota:
http://www.diamond-mind.com/articles/tmpred06.htm
by xbhaskarx on Jan 16, 2007 12:10 PM PST up reply actions 0 recs
i don't know why everyone is so upset
by xbhaskarx on Jan 16, 2007 11:57 AM PST reply actions 0 recs
Not very...
by OaktownPower on Jan 16, 2007 12:03 PM PST up reply actions 0 recs
Last year's projections
My take on this year's projections is that once again the A's offense is dependent on a lot of players with iffy health. Projection systems are all necessarily "dumb" - they have the information of how well players hit in the past, but no sense of why they did so. Crosby, Chavez, and Bradley are all very good bets to beat their projections if they stay healthy, but, of course, they are also poor bets to stay healthy. So the numbers listed here make sense as averages, but there is a lot of unpredictable variance there, even beyond the luck/sample size issues detailed in one of the articles Bhaskar linked.
The main difference from last year's team is that we don't have any hitters like Frank Thomas, who was a much more extreme version of the same thing - very high risk (health) but very high reward (production) when he was able to stay in the lineup. Even in the best case, I think it's unlikely that Piazza, Chavez, or anyone else will match Thomas' production from last year. We do have Rich Harden, though, who's sort of the pitching equivalent.
by andeux on Jan 16, 2007 12:52 PM PST up reply actions 0 recs
Could you post entries
Thanks
by apilgrim on Jan 16, 2007 12:03 PM PST reply actions 0 recs
sure, give me a bit
by Ryan Armbrust on Jan 16, 2007 12:06 PM PST up reply actions 0 recs
and here they are
http://thepastime.net/2007/01/16/the-grand-unified-projection-additions/
by Ryan Armbrust on Jan 16, 2007 12:30 PM PST up reply actions 0 recs
i would have thought
by xbhaskarx on Jan 16, 2007 12:40 PM PST up reply actions 0 recs
thanks for the post
by apilgrim on Jan 16, 2007 6:52 PM PST up reply actions 0 recs
Projections
A Yankees Blog ran the CHONE projections through 100 Diamond Mind Baseball seasons and had the A's scoring 836 runs and allowing 722 (this assumes Crosby and Harden stay healthy): http://yankeefan.blogspot.com/2006/12/extremely-early-projections.html
by Danny on Jan 16, 2007 12:08 PM PST reply actions 0 recs
but chone was the least reliable of the four
by xbhaskarx on Jan 16, 2007 12:11 PM PST up reply actions 0 recs
Good stuff ...
by devo on Jan 16, 2007 1:43 PM PST reply actions 0 recs
done
by Ryan Armbrust on Jan 16, 2007 2:05 PM PST up reply actions 0 recs
What happened?
by Flash G on Jan 16, 2007 3:06 PM PST reply actions 0 recs
Last year's results
JAMES ZIPS PrOPS PECOTA Actual
Chavez 849 854 810 833 786
Thomas 873 804* 839 926
Bradley 820* 854 799* 802 818
Johnson 828 826 840 815 704
Crosby 800 801 777 799 636
Swisher 776 782 785 802 864
Ellis 804 771 744 777 704
Kotsay 781 786 762 746 719
Payton 783 739 714 753 743
Kendall 745 713 715 671 709
Ginter 775 738 758 721 --
Kielty 769 751 754 756 770
Melhuse 756 664 716 648
Perez 773 672* 722 389
Scutaro 738 718 720 703 747
Everyone missed low on Thomas (coming off two injury-plagued seasons) and Swisher (coming off a mediocre rookie year with lots of strikeouts), and to a much lesser extent Scutaro (career year). Everyone missed high on Chavez (injuries), Johnson (AAAA), Crosby (injuries), Ellis (coming off a career year), Kotsay (injuries), Perez (just sucks). They were pretty accurate on Bradley, Kendall, Payton, and Kielty.
On the whole, PECOTA was a little better than the others, but not spectacularly so. And it kind of shows the limits of these systems, as they really can't say anything useful about who will suddenly get hurt, or unexpectedly stay healthy. All they can do is give a rough talent level of the hitters.
This year's team might be even harder to predict than last year's. The biggest question mark is obviously health, again. In addition there is Piazza (once a great hitter, but PECOTA hates him) and Durazo (hasn't played in the majors in a year and a half, so who knows), along with some prospects (Barton, Goleski) who probably aren't ready yet, but could surprise us.
by andeux on Jan 16, 2007 7:35 PM PST reply actions 0 recs
about individuals
That's why I don't put much faith in the individual player's projection, but when you look at the whole team, it's more accurate. For every player that exceeds expectations, there's one that doesn't meet them. I think they're more useful for projecting team results than individual. That's not to say that they can't get close for one player, but don't go betting the farm on it.
by Ryan Armbrust on Jan 16, 2007 10:56 PM PST up reply actions 0 recs
And how did the pool game go?
What I don't understand is how PECOTA and everybody came up with those projections for Lastings Milledge. Let me rephrase that, I know how they came up with those numbers I'm just shocked that no one had the common sense to realize how screwed up their data was.
by grover on Jan 16, 2007 11:09 PM PST up reply actions 0 recs
actually
by Ryan Armbrust on Jan 16, 2007 11:58 PM PST up reply actions 0 recs
BP explained
When you consider that he was 21, and he put up a major league equivalent .489 (I think it was) slugging, it's not surprising that he would have a good projection. As the scouts are generally in agreement with the stats guys on him, I'm inclined to agree. If we do get him, he'll probably be our best player within a couple of years and he has MVP potential down the road.
by yarky on Jan 17, 2007 4:08 AM PST up reply actions 0 recs
The bulk of his success came in April
He hit 247/341/408 for the rest of his AAA stint.
He hit 241/310/380 in 166 big league at bats.
Look at those last two sets of numbers, consider that they represent a shade under 400 consecutive AB, and tell me how you can project 281/358/443 next year in the Show.
by grover on Jan 17, 2007 5:50 AM PST up reply actions 0 recs
This is how
You raise legitimate concerns about Milledge's stats from last year, his drop off could indicate that pitchers have figured out weaknesses to exploit. It could also mean that Milledge had a minor injury after April, or that his momma stopped loving him, or he failed to emotionally handle his callup to New York and demotion back to Norfolk. These concerns would need to be addressed by scouting, because PECOTA (or any other statistically based system) is not going to have the answers to those questions.
Don't be surprised if that PECOTA projection ends up being pretty accurate, grover, and if it is close to true Milledge will be untouchable come July. Once a young potential star has success at the major league level, they almost never get traded. If we are going to trade for Milledge, this offseason will likely be the last chance to do so. Having never seen Milledge play, I have no particular scouting opinion of him, but the stats guys and scouts both think that Milledge has stardom in his future. They may be wrong, but that's a pretty broad spectrum of experts to disagree with.
by BlameChannel53 on Jan 17, 2007 6:56 AM PST up reply actions 0 recs
pray for the health ...
by devo on Jan 17, 2007 10:06 AM PST up reply actions 0 recs
yes, but . . .
by BlameChannel53 on Jan 17, 2007 10:12 AM PST up reply actions 0 recs
I'm not sure his momma ever loved him
I think the estimates need to be a touch more conservative in this case. I'm not saying Milledge will never reach those numbers (or beyond) but I think it a strecth to expect them in 2007.
Maybe an email to BP is in order...
by grover on Jan 17, 2007 11:18 AM PST up reply actions 0 recs
I think you're missing something
But anyway, this isn't a guy the stats like, but scouts have doubts about. He's raved about by scouts and purely statistics-based systems project him to be minor star at 22. I don't think a disappointing cup of coffee or a bad month at triple A are enough to counter all the weight on the positive side of the ledger here.
by yarky on Jan 17, 2007 2:51 PM PST up reply actions 0 recs
There's a red flag here
Milledge's best month, by far, came in April. He accomplished a 357/500/524 line while debuting in a new, more advanced league than he had played before. Normally a player will struggle when debuting in a more advanced level as he has to adjust to better talent, Milledge's early success does nothing but validate all the scouting reports that praise his natural talent.
However, being new to the league is a double edged sword. The opposing pitchers didn't know how to attack Milledge, they didn't have a scouting report that pointed out his tendencies, strengths and weaknesses. Some pitchers paid the price early in the year to devlope those scouting reports, but they learned and adapted and Milledge's production dropped off dramatically. He lost over 100 points in every offensive category and saw his K's go up.
The league had adjusted to Milledge and he showed little if any ability to adjust himself. Some of that was undoubtedly due to bouncing between Norfolk and New York but the cold hard truth is the opposition wrote a book on how to attack Milledge at the plate and they had a lot of success with it from May - September of 2006.
Baseball is not just about talent. It's also about learning and adjusting and that is something Milledge has yet to learn, or at least display. In fairness, the Mets have rushed him through 4 levels in the last two years and he has never had the time to learn from his mistakes and correct them. But it should not be ignored that for all his talent, Lasting Milledge struggled for the bulk of 2006 and it is his ability to learn from those struggles that will ultimately determine how good he becomes.
by grover on Jan 17, 2007 5:30 PM PST up reply actions 0 recs
Whenver you're dealing with an extreme ...
That said, it cuts both ways.
by devo on Jan 17, 2007 5:46 PM PST up reply actions 0 recs
That was a little vague
by grover on Jan 17, 2007 5:49 PM PST up reply actions 0 recs
You're welcome ...
by devo on Jan 17, 2007 10:48 PM PST up reply actions 0 recs
OK, there are 4 ways
Well, it's not like Shea Stadium is a RH hitters Mecca! Why wouldn't park effect come into play on both sides of the equation?
by grover on Jan 18, 2007 5:43 AM PST up reply actions 0 recs
It does come into play ...
Among players with at least 200 ABs, he did lead the Tide in OPS.
by devo on Jan 18, 2007 9:59 AM PST up reply actions 0 recs
OK
by yarky on Jan 18, 2007 1:54 AM PST up reply actions 0 recs
Each player is unique
by grover on Jan 18, 2007 5:48 AM PST up reply actions 0 recs

by 




















