Skip to main content

The Predictive Power of PythagenPuck

Last season I made use of the PythagenPuck formula as outlined by Alan Ryder of HockeyAnalytics.com to make some (fortunately) successful predictions back in November that the Ottawa Senators weren't anywhere near as bad as their sub-.500 record at the time indicated, and that the Boston Bruins weren't exactly the playoff-caliber team they appeared to be according to the standings.  The basic idea was that by looking at the ratio between each team's Goals For and Goals Against totals, you get a good idea of where their actual Winning Percentage will end up.  The presumption there is that a team's overall ability to score and prevent goals is more constant than the bounces and breaks that can decide individual game results, and over the course of 82 games the "luck" will even out to a large extent.
This summer, I thought I'd revisit that analysis, to answer a few questions.
1.  Did I just get lucky, or is PythagenPuck actually a better indicator than actual Winning Percentage of where teams stand during the early part of the season?
2.  At what point does it make sense to do this kind of analysis?  After 10 games, 20, 30...?
3.  What kinds of events during the season helped make large differences in team performance and the Goals For/Against ratio?  In other words, if a GM decides that action needs to be taken, what works, and what doesn't?
First, a few ground rules to clarify my use of PythagenPuck.  Since it relates to Goals For & Against during regular action, I back out points from Shootout Wins from the Winning Percentage totals.  I then took a snapshot at each game in the regular season, and recorded each teams Actual Winning Percentage, as well as Expected Winning Percentage as given by the PythagenPuck formula based on total Goals For & Against up to that point.  For those interested in the nitty-gritty details, yes, I adjusted the exponent at each step along the way to account for the league-wide goals-per-game value as it changed during the season.
Finally, I ran correlation figures at each game along the 82-game journey between [Actual Win %(at Game X) & Final Actual Win %], and [Exp Win %(at Game X) & Final Actual Win %], and produced the following chart.
PythagenPuck correlation to actual winning percentage NHL 2006-7
What this demonstrates is that for most of the 2006-07 season, the blue line (Expected Win %) was actually more closely correlated with the final actual Winning Percentage than Actual Win % at that point in time.  Only around Game 63 do actual results take over, which makes sense because at Game 82, the values are equal to each other by definition.  So at least for last season, Question #1 would appear to be that yes indeed, PythagenPuck does indeed hold some value.  I would need to see this play out over a few more seasons before I proclaimed it to be a crystal ball, however.
As to Question #2, we have an even more interesting answer.  It looks like around the Game 15 mark, both lines reach a certain point that isn't improved upon for another 25 games or so.  In other words, absent any major disruptive issues like a key player injury or coaching change, the picture you have around the 15-20 game mark is most likely a realistic one for the course of the season.  If, as a GM, both Actual and Expected Winning Percentage aren't where you'd like them to be by mid-November, then something substantial would need to change, and the longer you wait, the less time you're left with to turn things around.  If you have a case of mixed signals, such as last November when the Senators Expected Win % was much higher than Actual results, judgment would have to come into play as to the direction of the team.
To answer Question #3, I'll follow up with a series of posts over the next week looking at some of the major moves up or down that took place through the course of the season, and see how they relate to events such as injuries, coaching changes, and trades.
There are of course, a couple refinements I'd like to make to this analysis, which I may pursue before the start of the regular season.  First, I would like to make some adjustment for the quality of opposition and it's impact on the numbers used above, and secondly, I'd like to exclude Empty Net goals from the analysis.  Empty Net goals only serve to pad the Goals For of teams that have already pretty much locked a game up, and conversely inflate the Goals Against of teams that are about to lose.  When I ran summary-level analysis and excluded Empty Netters, I did get a slightly more accurate result between Expected Win % and Actual, but I haven't had time yet to make that adjustment on my game-by-game figures that drove the analysis above.

Popular posts from this blog

Cheer up, it's the holidays...

Why is it that various media outlets continue trying to put their own spin on the "what's wrong with the NHL" story? Our latest example comes from The Hockey News , in a piece by Jay Greenburg entitled, "Excitement Level On The Decline." Take the opening sentence: Attendance is down and yet still up from before the lockout, leaving it arguable whether buildings in New Jersey and Florida are half-full or half empty. It's no surprise that attendance is down from last season, particularly if you compare the first half of 2005-06 to the first half of this year. Coming out of the lockout, there were legions of fans starved to see the on-ice product, particularly in light of the massive rule changes. This year is more indicative of business as usual, so the fact that the league is above pre-lockout levels is a positive. Toss in the projection that overall revenues are increasing despite a 1% decrease in attendance, and I'd say that paying fans have come back

How I'm Trying To Make Money Sports Blogging

To kick off this series of articles general sports-blogging articles here at OTF Classic, I think it's best to start with a comment that Brad left here last week, after I shared my goals for 2012 , which include specific revenue targets: I considered diving into the world of internet marketing myself, but I felt that my friends would hate me for bugging them about stuff. I mean, it's pretty low-risk high-reward, so it's tempting. I wouldn't mind reading about tips on how to maximize impact of blogging in general to make it a legitimate income source. Trying to make money at sports blogging can be a very touchy subject - for the vast majority of us, this is an activity we pursue to both exercise our creativity and share our love of the game, whether it's hockey, football, badminton, whatever, with fellow fans. Mixing that personal conversation with a commercial message can turn people off, especially if it becomes too intrusive for the reader. It's not unrea

Celebrating a milestone month

I've been remiss in providing regular updates on my quest to turn this whole sports-blogging hobby into at least something of a significant side income, if not a career, but good news has a way of prompting action. That, and I've been heads-down busy working on a few different fronts to push things forward...