A Semiparametric Approach for Analyzing Nonignorable Missing Data

Hui Xie, Yi Qian, Leming Qu

NBER Working Paper No. 16270
Issued in August 2010
NBER Program(s):Productivity, Innovation, and Entrepreneurship

In missing data analysis, there is often a need to assess the sensitivity of key inferences to departures from untestable assumptions regarding the missing data process. Such sensitivity analysis often requires specifying a missing data model which commonly assumes parametric functional forms for the predictors of missingness. In this paper, we relax the parametric assumption and investigate the use of a generalized additive missing data model. We also consider the possibility of a non-linear relationship between missingness and the potentially missing outcome, whereas the existing literature commonly assumes a more restricted linear relationship. To avoid the computational complexity, we adopt an index approach for local sensitivity. We derive explicit formulas for the resulting semiparametric sensitivity index. The computation of the index is simple and completely avoids the need to repeatedly fit the semiparametric nonignorable model. Only estimates from the standard software analysis are required with a moderate amount of additional computation. Thus, the semiparametric index provides a fast and robust method to adjust the standard estimates for nonignorable missingness. An extensive simulation study is conducted to evaluate the effects of misspecifying the missing data model and to compare the performance of the proposed approach with the commonly used parametric approaches. The simulation study shows that the proposed method helps reduce bias that might arise from the misspecification of the functional forms of predictors in the missing data model. We illustrate the method in a Wage Offer dataset.

download in pdf format
   (225 K)

email paper

Machine-readable bibliographic record - MARC, RIS, BibTeX

Document Object Identifier (DOI): 10.3386/w16270

Published: Xie, Hui, Yi Qian and Leming Qu. 2011. A Semiparametric Approach for Analyzing Nonignorable Missing Data. Statistica Sinica. 21: 1881-1899.

Users who downloaded this paper also downloaded* these:
Graham, Campos de Xavier Pinto, and Egel w13981 Inverse Probability Tilting for Moment Condition Models with Missing Data
Graham w14376 Efficiency bounds for missing data models with semiparametric restrictions
Kline and Santos w15716 Sensitivity to Missing Data Assumptions: Theory and An Evaluation of the U.S. Wage Structure
Zheng, Wang, Glaeser, and Kahn w15621 The Greenness of China: Household Carbon Dioxide Emissions and Urban Development
Artopoulos, Friel, and Hallak w16947 Lifting the Domestic Veil: The Challenges of Exporting Differentiated Goods Across the Development Divide
NBER Videos

National Bureau of Economic Research, 1050 Massachusetts Ave., Cambridge, MA 02138; 617-868-3900; email:

Contact Us