A Reassessment of the Purchasing Managers’ Index

 

By Rolando F. Peláez

 

 

The Report on Business for manufacturing by the Institute for Supply Management (ISM), formerly the National Association of Purchasing Management (NAPM), is a highly significant indicator of economic activity (Niemira and Zukowski, 1998). The centerpiece of the report is the Purchasing Managers’ Index (PMI), which has a history of inducing large swings in bond and equity prices upon its release at 10:00 AM EST on the first business day of each month. Out of twenty-four regularly scheduled announcements, Fleming and Remolona (1997) rank the ISM survey seventh in order of importance for trading activity in the Treasury bond market. Ederington and Lee (1993, 1996) also report a statistically significant effect of the ISM survey on interest rates in both the U.S. Treasury and Eurodollar markets. Other studies have analyzed various aspects of the relationship between the PMI and economic activity (see, e.g., Kauffman 1999, Harris 1991, Bretz 1990, Klein and Moore 1988, and Dasgupta and Lahiri 1992). To some extent, financial market reactions to unanticipated changes in the PMI stem from expectations that the Federal Reserve may change its policy stance. According to the “policy anticipations hypothesis,” the Federal Reserve may change its target for the fed funds rate in response to macroeconomic news. The hypothesis argues that as the Fed’s objectives change, so does the market’s reaction to the arrival of unanticipated information, see, for example, Poole (1988) and Santomero (1991).

Given the PMI’s importance, it is worth considering if a better index can be constructed using ISM diffusion indices. This paper develops such an index. The new index outperforms the PMI as a predictor of the growth rate of real GDP. The plan of the work is as follows. The next section describes the PMI. This is followed by a comparison of the power of the PMI and of two alternative models to forecast the growth rate of real GDP. The discussion then turns to a new composite index of ISM indices. The final section concludes.

The Purchasing Managers’ Index

In 1931 the NAPM began to survey its members and to compute diffusion indices of various measures of economic activity for the manufacturing sector. The respondents answer questions that compare activity in the current month to the previous month. Responses are limited to better, worse, or unchanged. Adding the percentage reporting better and one-half of the percentage reporting unchanged, yields a diffusion index ranging from zero to 100 percent. Each index is seasonally adjusted individually, and the adjusted components then form the PMI. An increase in the PMI, say, from 60 to 70 indicates that an expansion is more widespread, but does not imply a proportionate acceleration in GDP growth.

In 1980, Theodore Torda, a senior economist with the U.S. Department of Commerce, introduced the PMI as an equally weighted index of five seasonally adjusted diffusion indices: new orders, production, employment, supplier deliveries, and inventories. In 1982, the Department of Commerce and the NAPM introduced a re-weighted PMI (Bretz, 1990). Torda selected the weights so as to maximize the correlation of the PMI with GDP growth (Torda, 1985). Unlike other indices that are re-weighted periodically to enhance their usefulness, the components’ weights have remained fixed since 1982. Torda’s PMI is,

 

 

       
   


where, NOI = new orders, PI = production, EMI = employment, SD = supplier deliveries, and INV = inventories, with all variables being expressed as diffusion indices.

The PMI provides the earliest indication of the strength of economic activity. Other monthly indicators released subsequently are subject to revisions that cloud their initial informational value. For example, the PMI precedes the advance release of GDP by one month and the final release by three months. It also precedes by three weeks the release of the Conference Board’s composite coincident index, which initially reflects only three of four components: industrial production, nonfarm employment, and personal income less transfer payments. The fourth component, manufacturing and trade sales, is unavailable for another two weeks. Moreover, all of the components undergo revisions in the subsequent two months. In contrast, the PMI itself is not revised, but the U.S. Department of Commerce revises the seasonal adjustment factors in January of each year.

Considering the scale of structural change in the economy over the last two decades, the fixed weight PMI has exhibited remarkable longevity. Economists have long known that sporadic breaks in parameters give rise to subsets of a sample with different parameter values (Keynes, 1921). A large body of recent literature confirms that the dynamics of many economic time-series shift unexpectedly. For example, Stock and Watson (1996) report that structural instability characterizes the behavior of seventy-six macroeconomic time-series. As a result, innovative econometric methods have been developed for dealing with structural breaks, for example, Hamilton’s (1989) Markov switching model. Given that many of the empirical regularities depicting the complex of relationships in economic models have changed over the last two decades, it is worth considering if an improvement to the PMI can be found, in the light of an additional two decades of data.

   
   


Alternative Models

This section compares the predictive value of the PMI to that of an unconstrained version of the PMI and a parsimonious model of three ISM indices. All diffusion indices are from the ISM’s Report on Business for manufacturing. The seasonally adjusted monthly indices were compacted into quarterly observations by averaging GDP, seasonally adjusted, in chained (1996) dollars is from the U.S. Department of Commerce, Bureau of Economic Analysis. The models are,

Equation 2, regresses the quarterly growth rate of real GDP (Gt) on the PMI. Equation 3 includes the five components of the PMI, but without a priori restrictions on their weights, that is, the model allows for time-varying coefficients and weights. This model outperforms Equation 2, confirming that the PMI is inefficiently weighted. Equation 4 consists of only the new orders, employment, and supplier deliveries indices.

Table 1 shows estimation results and tests of model adequacy for the period 1950:1—2003:1. Equation 2 is inferior to its rivals judging by the adjusted R-squared. The Schwarz criterion (SC) is a procedure for selecting undominated parsimonious models. Since a low value is preferred, both equations 3 and 4 are preferred to equation 2. It is worth noting that only two components of the PMI (PI and EMI) enter equation 3 significantly. This is not surprising given the high degree of collinearity among the PMI’s components. The pair-wise correlation coefficients in Table 2 show that some components are more strongly correlated with each other than with the growth rate of GDP. In fact, PI and NOI are almost perfect substitutes for each other. For all models, tests of the residuals reveal the absence of serial correlation, heteroscedasticity, and autoregressive conditional heteroscedasticity (ARCH). However, consistent with the presence of outliers, the Doornik-Hansen (1994) chi-squared test rejects the null of normality.

   



   


Out-of-Sample Forecasts of Real GDP Growth Rates

Ex ante one-quarter-ahead forecasts were obtained by recursively sampling a moving window of seventy quarters. Each model was first estimated sampling 1948:3— 1965:4, and forecasts for 1966:1 were obtained. The models were then re-estimated sampling 1948:4—1966:1 in order to obtain out-of-sample forecasts for 1966:2. At each step, the sample adds one more recent observation while discarding the most distant observation. This flexible procedure is motivated by the desire to allow the parameters to vary in response to structural changes and is also an attempt to confront a potential problem concerning supplier deliveries.

Supplier deliveries (SD), or vendor performance, is the sum of the percentage of respondents reporting slower deliveries and one-half of the percentage reporting unchanged. An increase in this index implies rising backlogs due to a more rapidly growing economy. As reported by the ISM, this series stretches back to January 1948, yet it consists of two series from different sources spliced together. The first segment from 1948 through 1970 is from the Purchasing Management Association of Chicago, while the segment since 1971 is from the ISM (Kauffman, 1999). Equation 1 shows that SD enters the PMI with a fixed weight of 0.15. However, in an unconstrained recursive regression framework, as in Equation 3, its coefficient is highly variable and ranges over positive and negative values. To some extent, parameter instability may be due to the splicing of two series; one with a regional focus and the other with a national focus. Attempting to mitigate this problem, the estimation used a seventy-quarter moving window of data, thus gradually phasing-out the earlier data set.

Forecast Accuracy

The three most commonly used measures of forecast accuracy are mean squared forecast error (MSFE), mean absolute error (MAE), and Theil’s inequality coefficient. Low values are preferred; indeed, all three measures are zero if forecasts are perfect. Formally, let F2, F3, and F4 denote the one-step-ahead forecasts obtained with equations 2, 3 and 4, respectively. Table 3 shows that F3 and F4 outperform F2 according to all three measures. However, it is important to test if the difference in MSFE is statistically significant. Granger and Newbold (1977) point out that if the forecast errors from one model are correlated with those from another, the usual variance ratio or F test is inappropriate for this task. They develop a test of one-step-ahead forecasts that is valid given unbiased forecasts and absence of autocorrelated forecast errors. Their test is similar to a test proposed by Morgan (1939).

   


   

MFE and MAFE are the mean forecast error and the mean absolute forecast error, respectively. MSFE is the mean-squared forecast error, and RMSFE is the square root of the mean squared forecast error. Theil’s U statistic, or inequality coefficient, assesses predictive accuracy relative to a naïve no-change model. It is unitary when the MSFE equals the mean square error of naïve no-change forecasts, and it is less than 1.0 if predictions are more accurate than no-change extrapolations. P-values are the significance levels of test statistics (i.e., the probability of getting a number at least as large under the zero null). The Jarque-Bera (1980) statistic tests for normality of the forecast errors.

The first step is to determine if there is systematic bias in a forecast, that is if the MFE differs significantly from zero. This is carried out by regressing the forecast error on a constant and using the t-test to determine if the estimated constant (i.e., the MFE), differs significantly from zero. The test results (P-values for forecast bias) shown in Table 3 indicate that all three forecasts are unbiased. Now, let two competing models produce forecast errors e1t and e2t. The Granger-Newbold test of equal forecast accuracy is based on the following orthogonalization,

The two forecast error variances, and hence the two expected squared forecast errors, will be equal only if the covariance between Xt and Zt is zero,

Harvey, Leybourne, and Newbold (1997) set up the Granger-Newbold test in the form of the following regression,


Under the null of equal MSFE, Xt and Zt are uncorrelated, i.e., the estimated ß equals zero. The test results shown in Table 4 reject the null of equal MSFE for F2 and F3, and for F2 and F4, but not for F3 and F4. Therefore, while the PMI-based F2 is demonstrably inferior to its rivals, the difference in MSFE between F3 and F4 is not
significant.

   
   


Forecast Encompassing

Forecast encompassing tests are important for evaluating competing models. Forecast encompassing is equivalent to the concept of conditional efficiency of Granger and Newbold (1973). A forecast is conditionally efficient if combining it with a rival forecast does not yield a smaller forecast error variance than that of the original forecast alone. Denote the respective forecast errors e2, e3, and e4,



If the difference between F3 and F2 explains a portion of e2, then equation 2 can be improved by incorporating some features from equation 3. Equivalently, combining F2 and F3 will yield a better forecast than either F2 or F3 individually. These tests are part of a progressive research strategy designed to find a model that dominates its rivals (Clements and Hendry, 1998). Under the null, F3 does not encompass F2. The null is rejected, that is, F3 encompasses F2 if in equation (12) a differs significantly from zero,

   
   
The results of forecast encompassing tests in Table 5 show that the PMI-based forecast, F2, does not encompass F3 or F4. However, F3 encompasses F2 leading to the inescapable conclusion that the PMI is inefficiently weighted, since the only difference between the information sets in equations 2 and 3 is in the components’ weights. F4 also encompasses F2; therefore, both equations 3 and 4 dominate equation 2. Finally, F3 (forecasts from the unconstrained model) do not encompass F4. Therefore, the inventories and production indices have no informational value in the presence of new orders, employment, and supplier deliveries, again underscoring the inefficiency of the PMI.
   
   


A New Index

This section proposes a new index as an alternative to the PMI. The index consists of new orders, employment, and supplier deliveries and is designated as NES. Unlike the PMI, it is characterized by variable weights. Coefficients were obtained with rolling regressions estimated over a seventy-quarter moving window. The coefficients reflect only information available up to each point t. The sum of the recursive coefficients is St,

where the α's are the regression coefficients of equation (4). Unlike the fixed-weighted PMI, the components’ weights now evolve over time with the evolving recursive coefficients,

In the proposed index the components’ weights are lagged one quarter, so that at each point they reflect only past information,

   
   

 

Figure 1 plots the NES and the PMI (dotted line). For the sample period 1966:1—2003:1, the NES leads the PMI by one quarter or more at nearly every turning point. Equation 17 is characterized by variable weights, but a simpler version with constant weights also outperforms the PMI. Constant weights may be obtained from regression coefficients, or simply by averaging the variable weights obtained for the period 1966:1—2003:1,

The regressions in Table 6 show that a constant-weighted version of the NES also outperforms the PMI as a measure of economic activity over the extended sample period 1948:1—2003:1, as well as over the more recent 1966:1—2003:1.

   
   


Conclusions

Theodore Torda’s ingenious creation, the PMI, has enjoyed remarkable longevity as an indicator of economic activity, notwithstanding the declining importance of the manufacturing sector. While several studies have documented the effects on financial markets of unexpected changes in the PMI, the literature has ignored whether the PMI is optimally weighted. The evidence in this paper shows that it is not. Specifically, in the presence of the new orders, employment and supplier deliveries indices, the production and inventories indices contribute nothing to the explanation of GDP growth. The new composite index proposed in this paper has greater informational value than the PMI. Given the importance of the PMI for market participants, an index that is as timely as the PMI and that provides a more accurate picture of the economy should be well received.

A C K N OW L E D G E M E N T S

The author acknowledges useful comments by Ralph Kauffman and by two anonymous referees.

   
   

 

R E F E R E N C E S
Bretz, Robert, J. 1990. “Behind the Economic Indicators of the NAPM Report on Business.” Business Economics. Vol. 25 No. 3. Pp. 42-50.

Breusch, T.S. 1978. “Testing for Autocorrelation in Dynamic Linear Models.” Australian Economic Papers. Vol. 17. Pp. 334-
355.

Chow, G.C. 1980. “Tests of Equality Between Sets of Coefficients
in Two Linear Regressions.” Econometrica. Vol. 28. Pp. 591-605.

Clements, Michael P., and David F. Hendry. 1998. Forecasting
Economic Time Series. Cambridge: Cambridge University Press.

Cook, Timothy and Korn, Stephen. 1991. “The Reaction of
Interest Rates to the Employment Report: The Role of Policy
Anticipations.” Economic Review. Federal Reserve Bank of Richmond. September-October. Pp. 3-12.

Dasgupta, Susmita and Lahiri, Kajal. 1992. “A Comparative
Study of Alternative Methods of Quantifying Qualitative Survey
Responses Using NAPM Data.” Journal of Business and Economic
Statistics. Vol. 10. No. 4. Pp. 391-400. October.

Doornik, J.A., and H. Hansen. 1994. “A Practical Test for
Univariate and Multivariate Normality.” Discussion Paper. Nuffield
College.

Ederington, Louis H. and Lee, Jae H. 1996. “The Creation and
Resolution of Market Uncertainty: The Impact of Information Releases on Implied Volatility.” The Journal of Financial and Quantitative Analysis. Vol. 31. No. 4. Pp. 513-39.

Ederington, Louis H. and Lee, Jae H. 1993. “How Markets
Process Information: News Releases and Volatility.” The Journal of
Finance. Vol. 48. No. 4. Pp.1161-91.

Engle, R.F. 1982. “Autoregressive Conditional Heteroscedasticity, With Estimates of the Variance of United Kingdom Inflations.” Econometrica. Vol.50. Pp. 987-1007.

Fleming, Michael, J., and Eli, M. Remolona. 1997. “What Moves
the Bond Market?” Economic Policy Review. Federal Reserve Bank of New York. Pp. 31-50. December.

Godfrey, L.G. 1978 “Testing for Higher Order Serial Correlation
in Regression Equations when the Regressors Include Lagged
Dependent Variables.” Econometrica. Vol. 46. Pp.1303-1313.

Granger, C.W.J., and Paul Newbold. 1973. “Some Comments on
the Evaluation of Economic Forecasts.” Applied Economics. Vol. 5. Pp. 35-47.

Granger, C.W.J., and Paul Newbold.1977. Forecasting Economic
Time Series. New York: Academic Press.

Hamilton, James D. 1989. “A New Approach to the Economic
Analysis of Nonstationary Time Series and the Business Cycle.”
Econometrica. Vol. 57. Pp. 357-384.

Harris, Ethan S. 1991. “Tracking the Economy with the Purchasing Managers’ Index.” Quarterly Review. Federal Reserve
Bank of New York. Pp. 61-69. Fall.

Harvey, D.I., S.J. Leybourne, and P. Newbold. 1997. “Testing the
Equality of Prediction Mean Squared Errors.” International Journal of
Forecasting. Vol. 13. Pp. 281-291.

Kauffman, Ralph G. 1999. “Indicator Qualities of the NAPM
Report on Business.” The Journal of Supply Chain Management. Pp.
29-37. Spring.

Keynes, John Maynard. 1921. A Treatise on Probability. London:
McMillan.

Klein, Philip A., and Geoffrey H. Moore. 1988. “N.A.P.M.
Business Survey Data: Their Value as Leading Indicators.” Journal of
Purchasing and Materials Management. Pp. 32-40. Winter

Morgan, W.A. 1939. “A Test for Significance of the Difference
Between Two Variances in a Sample.” Biometrika. Vol. 31. Pp. 13-19.

Niemira, Michael P., and Gerald F. Zukowski. 1998. Trading the
Fundamentals. New York: Academic Press.

Poole, William. 1988. “Monetary Policy Lessons of Recent
Inflation and Disinflation.” Journal of Economic Perspectives. Vol. 2.
Pp.73-100.

Santomero, Anthony M. 1991. “Money Supply Announcements: A
Retrospective.” Journal of Economics and Business. Vol. 43. Pp. 1-23.

Stock, J.H., and M.W. Watson. 1996. “Evidence on Structural
Instability in Macroeconomic Time-Series.” Journal of Business and
Economic Statistics. Vol. 14. Pp. 11-30.

Torda, Theodore S. 1985. “Purchasing Management Index
Provides Early Clue on Turning Points.” Business America. U.S.
Department of Commerce. Pp 11-13. June 24.

White, H. 1980. “A Heteroscedastic-Consistent covariance
Matrix Estimator and a Direct Test for Heteroscedasticity.”
Econometrica. Vol. 48. Pp. 817-838.