Shop Fantasy Travel Free e-mail About Us Statitudes

 

CNN/SI Home Statitudes Front Archive Fantasy Baseball Pro Footballs College Football Pro Basketball College Basketball Hockey Golf Soccer Tennis Motor Sports Women's Sports

EVENTS
 Sportsman of the Year
 Heisman Trophy
 Swimsuit 2001

CENTERS
 Fantasy Central
 Inside Game
 Video Plus
 Statitudes
 Your Turn
 Message Boards
 Email Newsletters
 Golf Guide
 Cities
 

CNNSI.com GROUP
 Sports Illustrated
 Life of Reilly
 SI Women
 SI for Kids
 Press Room
 TBS/TNT Sports
 CNN Languages

COMMERCE
 SI Customer Service
 SI Media Kits
 Get into College
 Sports Memorabilia
 TeamStore

Eighth-grade math helps hockey

Click here for more on this story
Posted: Monday October 15, 2001 8:23 PM
Updated: Tuesday October 16, 2001 12:55 AM

 

By Marc Foster and Chris Apple, special to CNNSI.com

One of the things we hope to accomplish through this column is to elevate the level of statistical analysis in hockey to the level seen in baseball. In baseball, the science of stats is called sabermetrics, after SABR, the Society for American Baseball Research. While hockey has a similar organization called the Society for International Hockey Research (SIHR), this is yet to be called "sihrmetrics" or "sirmetrics." "Hockeymetrics" has been bandied about but it hasn't caught on yet. We're open to suggestions.

Anyway, back to the stats.

Bill James, the Dean of Sabermetrics, has a simple tool to measure if a team over- or under-achieved for a given season based on the total runs scored and allowed. Ideally, a team that scores 800 runs and allows 800 runs should finish with a winning percentage at exactly .500. But what about teams who score 850 and allow 700? James came up with the following equation to handle this:

Pythagorean Winning Percentage

RF^2
-------------
RF^2 + RA^2

  • RF = runs scored; RA = runs allowed.

    Conveniently, the same equation works just as well for hockey. We can substitute runs for goals to nearly the same effect, giving us:

    Pythagorean Winning Percentage

    GF^2
    -------------
    GF^2 + GA^2

  • GF = goals scored; GA = goals allowed.

    There is one small difference, however. Depending upon the era, adjustments need to be made to the exponential factor. In the early days of hockey (prior to 1942), when scoring was at a premium and the best goalies recorded shutouts nearly every other game, the factor is 1.63. In the Original Six era (1942-67), the final factor used is 1.92. In the Era of Expansion (from 1967 to present), 2.03 minimizes the total error, but is close enough to 2.0 to use the latter for everyday use.

    Once the Pythagorean Winning Percentage (Pyth%) has been calculated, it can be multiplied by the number of possible points earned in a season to easily evaluate how the team performed in comparison to theory. This also allows a statistician to derive the average error -- how the equation performed in relation to reality.

    The average error for the first era was 3.01 points, second era 3.33 points, and for the current era 3.61 points. It may appear that the equation is losing its effectiveness over time, but one must also remember that more games are played today than in the past. The modern error for an 82-game season is about 2.2 percent, while the early period error for seasons of 48 games or less is at least 3.76 percent.

    One problem with the current use of the equation exists, though; regulation ties. The equation is based on the concept that all games are created equal. In the past, they were since exactly two points were on the line every night, creating a hockey equivalent to the First Law of Thermodynamics -- matter and energy cannot be created nor destroyed. Regulation ties screwed this up. Without a diatribe about what an abomination the new scoring system is, simply stated: it makes working with this equation more difficult. This resulted in last season's chart examining stats in both the original format and one in which all regulation ties are counted as losses.

    Looking at last season, the top and bottom of the rankings are of some interest. The table is sorted by the real system difference, but removing the regulation ties reveals some sharper splits in the data. Carolina, Detroit, Boston, Colorado and Phoenix all outperformed the equation by two wins or more, while New Jersey, Toronto, Montreal and Florida all underperformed by three to four wins each. It is interesting to note that although Montreal finished well out of the playoffs, had the standings reflected this equation, the Canadiens would actually have been less than five points out. Perhaps they’re already making up for it early this season.

    It may never be quantitatively determined why one team swings up whereas another goes down. While a correlation for team average age came up empty, there was a strong correlation, however, between team age and overall performance. On the subjective side, it could be anything from coaching to an abundance of lucky (or unlucky) breaks. James has some ideas on multiple season trends for teams, but many of those come from a time when rosters were much more static than today.

    Our Web site compiles a nearly complete history of Pythagorean records dating back to the start of the NHL broken up by both season and franchise. We’ll also be returning to this topic around the middle of the season to see where teams stand.

    2000-01 Rankings, new system
    Team  Pts.  GF  GA  Real %  Pyth%  Old Pt.  Pyth Pt.  Diff_R  Diff_O 
    1 Boston  88  227  249  .537  .454  80  74.4  13.6  5.6 
    2 Carolina  88  212  225  .537  .470  85  77.1  10.9  7.9 
    3 Detroit  111  253  202  .677  .611  107  100.2  10.8  6.8 
    4 Colorado  118  270  192  .720  .664  114  108.9  9.1  5.1 
    5 Vancouver  90  239  238  .549  .502  83  82.3  7.7  0.7 
    6 Phoenix  90  214  212  .549  .505  87  82.8  7.2  4.2 
    7 Pittsburgh  96  281  256  .585  .546  93  89.6  6.4  3.4 
    8 Philadelphia  100  240  207  .610  .573  97  94.0  6.0  3.0 
    9 Washington  96  233  211  .585  .549  92  90.1  5.9  1.9 
    10 Calgary  73  197  236  .445  .411  69  67.3  5.7  1.7 
    11 Columbus  71  190  233  .433  .399  65  65.5  5.5  -0.5 
    12 Anaheim  66  188  245  .402  .371  61  60.8  5.2  0.2 
    13 Minnesota  68  168  210  .415  .390  63  64.0  4.0  -1.0 
    14 Nashville  80  186  200  .488  .464  77  76.1  3.9  0.9 
    15 Ottawa  109  274  205  .665  .641  105  105.1  3.9  -0.1 
    16 Dallas  106  241  187  .646  .624  104  102.4  3.6  1.6 
    17 Edmonton  93  243  222  .567  .545  90  89.4  3.6  0.6 
    18 Tampa Bay  59  201  280  .360  .340  54  55.8  3.2  -1.8 
    19 San Jose  95  217  192  .579  .561  92  92.0  3.0  0.0 
    20 Atlanta  60  211  289  .366  .348  58  57.0  3.0  1.0 
    21 Buffalo  98  218  184  .598  .584  97  95.8  2.2  1.2 
    22 NY Rangers  72  250  290  .439  .426  71  69.9  2.1  1.1 
    23 Chicago  71  210  246  .433  .422  66  69.1  1.9  -3.1 
    24 Los Angeles  92  252  228  .561  .550  89  90.2  1.8  -1.2 
    25 St. Louis  103  252  228  .628  .620  98  101.7  1.3  -3.7 
    26 Florida  66  200  246  .402  .398  57  65.3  0.7  -8.3 
    27 NY Islanders  52  185  268  .317  .323  49  52.9  -0.9  -3.9 
    28 Toronto  90  232  207  .549  .557  85  91.3  -1.3  -6.3 
    29 Montreal  70  206  232  .427  .441  64  72.3  -2.3  -8.3 
    30 New Jersey  111  295  195  .677  .696  108  114.1  -3.1  -6.1 
    Pts. -- actual points earned by the team (2 for a win, 1 for a tie or RT)
    GF -- goals scored by the team
    GA -- goals scored by the opposition
    Real% -- The real winning percentage, based on pts/164
    Pyth% -- The Pythagorean winning percentage, based on the equation
    OldPt. -- Points team would earn in old system, where an OT loss earns no points
    PythPt. -- Points earned based on Pyth%, where PythPt.= Pyth% x 164
    Diff_R -- The difference between Pts and PythPt.
    Diff_O -- The difference between OldPt and PythPt.
     

    Top overachieving teams of the past 50 years, by percentage
    Season  Team  Pts.  GF  GA  Win%  Pyth%  %Diff  PythPts.  PtsDiff. 
    1979-80  Philadelphia  116  327  254  0.725  0.625  0.099  100.08  15.92 
    1985-86  Washington  107  315  272  0.669  0.574  0.095  91.83  15.17 
    1985-86  Edmonton  119  426  310  0.744  0.656  0.088  104.95  14.05 
    1957-58  Detroit  70  176  207  0.500  0.423  0.077  59.19  10.81 
    1993-94  Pittsburgh  101  299  285  0.601  0.524  0.077  88.09  12.91 
    1983-84  Edmonton  119  446  314  0.744  0.671  0.073  107.35  11.65 
    1956-57  NY Rangers  66  184  227  0.471  0.401  0.071  56.08  9.92 
    1987-88  Buffalo  85  283  305  0.531  0.462  0.069  73.93  11.07 
    Pts. -- actual points earned by team during season
    GF -- goals scored by team
    GA -- goals allowed by team
    Win% -- actual winning percentage, or Pts/(2 x GP)
    Pyth% -- winning percentage derived using Pythagorean equation
    %Diff -- the difference in Win% and Pyth%
    PythPts. -- theoretical points earned using Pythagorean equation
    PtsDiff. -- the difference between Pts and PythPts
     

    Top underachieving teams of the past 50 years, by percentage
    Season  Team  Pts.  GF  GA  Win%  Pyth%  %Diff  PythPts.  PtsDiff. 
    1994-95  Chicago  53  156  115  0.552  0.650  -0.098  62.40  -9.40 
    1952-53  Detroit  90  222  133  0.643  0.728  -0.085  101.90  -11.90 
    1975-76  NY Islanders  101  297  190  0.631  0.712  -0.081  113.98  -12.98 
    1974-75  Boston  94  345  245  0.588  0.667  -0.079  106.73  -12.73 
    1967-68  Toronto  76  209  176  0.514  0.586  -0.073  86.78  -10.78 
    1953-54  Montreal  81  195  141  0.579  0.651  -0.072  91.11  -10.11 
    Pts. -- actual points earned by team during season
    GF -- goals scored by team
    GA -- goals allowed by team
    Win% -- actual winning percentage, or Pts/(2 x GP)
    Pyth% -- winning percentage derived using Pythagorean equation
    %Diff -- the difference in Win% and Pyth%
    PythPts. -- theoretical points earned using Pythagorean equation
    PtsDiff. -- the difference between Pts and PythPts
     

    Marc Foster is a research analyst in Fort Worth, Texas. Chris Apple is a database analyst/Internet specialist in Lincoln, Neb. Together, they operate HockeyResearch.com, and hope to one day elevate statistical research in hockey to the level seen in other sports.


     
    Related information
    Stories
    Just the Stats: Players finding it harder to lie about stats
    Multimedia
    Visit Video Plus for the latest audio and video
    Search our site Watch CNN/SI 24 hours a day
    Sports Illustrated and CNN have combined to form a 24 hour sports news and information channel. To receive CNN/SI at your home call your cable operator or DirecTV.


    CNNSI Copyright © 2001
    CNN/Sports Illustrated
    An AOL Time Warner Company.
    All Rights Reserved.

    Terms under which this service is provided to you.
    Read our privacy guidelines.