Estimation of Treatment Effects from Combined Data: Identification versus Data Security

Tatiana Komarova, Denis Nekipelov, Evgeny Yakovlev

This chapter is a preliminary draft unless otherwise noted. It may not have been subjected to the formal review process of the NBER. This page will be updated as the chapter is revised.

Chapter in forthcoming NBER book Economic Analysis of the Digital Economy, Avi Goldfarb, Shane Greenstein, and Catherine Tucker, editors
Conference held June 6-7, 2013
Forthcoming from University of Chicago Press

The security of sensitive individual data is a subject of indisputable importance. One of the major threats to sensitive data arises when one can link sensitive information and publicly available data. In this paper the authors demonstrate that even if the sensitive data are never publicly released, the point estimates from the empirical model estimated from the combined public and sensitive data may lead to a disclosure of individual information. Their theory builds on the work in Komarova, Nekipelov and Yakovlev (2011) where they analyze the individual disclosure that arises from the releases of marginal empirical distributions of individual data. The disclosure threat in that case is posed by the possibility of a linkage between the released marginal distributions. In this chapter, they analyze a different type of disclosure. Namely, they use the notion of the risk of statistical partial disclosure to measure the threat from the inference on sensitive individual attributes from the released empirical model that uses the data combined from the public and private sources. As the main example the authors consider a treatment effect model in which the treatment status of an individual constitutes sensitive information.

download in pdf format
   (201 K)

email paper

This paper is available as PDF (201 K) or via email.

This paper was revised on May 14, 2014

Machine-readable bibliographic record - MARC, RIS, BibTeX

Users who downloaded this chapter also downloaded these:
Wu and Brynjolfsson The Future of Prediction: How Google Searches Foreshadow Housing Prices and Sales
Danaher, Dhanasobhon, Smith, and Telang Understanding Media Markets in the Digital Age: Economics and Methodology
Gans and Halaburda Some Economics of Private Digital Currency
Mann Information Lost: Will the "Paradise" that Information Promises, to both Consumer and Firm, be "Lost" on Account of Data Breaches? The Epic is Playing Out
Agrawal, Horton, Lacetera, and Lyons Digitization and the Contract Labor Market: A Research Agenda
NBER Videos

National Bureau of Economic Research, 1050 Massachusetts Ave., Cambridge, MA 02138; 617-868-3900; email:

Contact Us