Are More Data Always Better for Factor Analysis?

Jean Boivin; Serena Ng

doi:10.3386/w9829

Are More Data Always Better for Factor Analysis?

Jean Boivin & Serena Ng

Working Paper 9829

DOI 10.3386/w9829

Issue Date July 2003

Factors estimated from large macroeconomic panels are being used in an increasing number of applications. However, little is known about how the size and the composition of the data affect the factor estimates. In this paper, we question whether it is possible to use more series to extract the factors, and yet the resulting factors are less useful for forecasting, and the answer is yes. Such a problem tends to arise when the idiosyncratic errors are cross-correlated. It can also arise if forecasting power is provided by a factor that is dominant in a small dataset but is a dominated factor in a larger dataset. In a real time forecasting exercise, we find that factors extracted from as few as 40 pre-screened series often yield satisfactory or even better results than using all 147 series. Weighting the data by their properties when constructing the factors also lead to improved forecasts. Our simulation analysis is unique in that special attention is paid to cross-correlated idiosyncratic errors, and we also allow the factors to have stronger loadings on some groups of series than others. It thus allows us to better understand the properties of the principal components estimator in empirical applications.

Copy Citation

Jean Boivin and Serena Ng, "Are More Data Always Better for Factor Analysis?," NBER Working Paper 9829 (2003), https://doi.org/10.3386/w9829.

Download Citation

MARC RIS BibTeΧ

Are More Data Always Better for Factor Analysis?

Published Versions

Related

Topics

Programs

More from the NBER