Using Split Samples to Improve Inference about Causal Effects
NBER Working Paper No. 21842
We discuss a method aimed at reducing the risk that spurious results are published. Researchers send their datasets to an independent third party who randomly generates training and testing samples. Researchers perform their analysis on the former and once the paper is accepted for publication the method is applied to the latter and it is those results that are published. Simulations indicate that, under empirically relevant settings, the proposed method significantly reduces type I error and delivers adequate power. The method – that can be combined with pre-analysis plans – reduces the risk that relevant hypotheses are left untested.
You may purchase this paper on-line in .pdf format from SSRN.com ($5) for electronic delivery.
Supplementary materials for this paper:
Document Object Identifier (DOI): 10.3386/w21842