NATIONAL BUREAU OF ECONOMIC RESEARCH
NATIONAL BUREAU OF ECONOMIC RESEARCH

Imputation in U.S. Manufacturing Data and Its Implications for Productivity Dispersion

T. Kirk White, Jerome P. Reiter, Amil Petrin

NBER Working Paper No. 22569
Issued in August 2016
NBER Program(s):IO, PR

In the U.S. Census Bureau's 2002 and 2007 Censuses of Manufactures 79% and 73% of observations respectively have imputed data for at least one variable used to compute total factor productivity. The Bureau primarily imputes for missing values using mean-imputation methods which can reduce the true underlying variance of the imputed variables. For every variable entering TFP in 2002 and 2007 we show the dispersion is significantly smaller in the Census mean-imputed versus the Census non-imputed data. As an alternative to mean imputation we show how to use classification and regression trees (CART) to allow for a distribution of multiple possible impute values based on other plants that are CART-algorithmically determined to be similar based on other observed variables. For 90% of the 473 industries in 2002 and the 84% of the 471 industries in 2007 we find that TFP dispersion increases as we move from Census mean-imputed data to Census non-imputed data to the CART-imputed data.

You may purchase this paper on-line in .pdf format from SSRN.com ($5) for electronic delivery.

Access to NBER Papers

You are eligible for a free download if you are a subscriber, a corporate associate of the NBER, a journalist, an employee of the U.S. federal government with a ".GOV" domain name, or a resident of nearly any developing country or transition economy.

If you usually get free papers at work/university but do not at home, you can either connect to your work VPN or proxy (if any) or elect to have a link to the paper emailed to your work email address below. The email address must be connected to a subscribing college, university, or other subscribing institution. Gmail and other free email addresses will not have access.

E-mail:

Machine-readable bibliographic record - MARC, RIS, BibTeX

Document Object Identifier (DOI): 10.3386/w22569

Users who downloaded this paper also downloaded* these:
Cravino and Levchenko w22498 Multinational Firms and International Business Cycle Transmission
Kane and Staiger w14607 Estimating Teacher Impacts on Student Achievement: An Experimental Evaluation
Cohen, Hahn, Hall, Levitt, and Metcalfe w22627 Using Big Data to Estimate Consumer Surplus: The Case of Uber
White, Reiter, and Petrin w17816 Plant-level Productivity and Imputation of Missing Data in U.S. Census Manufacturing Data
Li and Hall w22473 Depreciation of Business R&D Capital
 
Publications
Activities
Meetings
NBER Videos
Themes
Data
People
About

National Bureau of Economic Research, 1050 Massachusetts Ave., Cambridge, MA 02138; 617-868-3900; email: info@nber.org

Contact Us