We investigate the extent to which tests of financial asset pricing models may be biased
by using properties of the data to construct the test statistics. Specifically, we focus on
tests using returns to portfolios of common stock where portfolios are constructed by
sorting on some empirically motivated characteristic of the securities such as market
value of equity. We present both analytical calculations and Monte Carlo simulations
that show the effects of this type of data-snooping to be substantial. Even when the
sorting characteristic is only marginally correlated with individual security statistics, 5
percent tests based on sorted portfolio returns may reject with probability one under
the null hypothesis. This bias is shown to worsen as the number of securities increases
given a fixed number of portfolios, and as the number of portfolios decreases given
a fixed number of securities. We provide an empirical example that illustrates the
practical relevance of these biases.
*Published:
The Review of Financial Studies, Vol. 3, No. 3, pp. 431-467, (1990).
You may purchase this paper on-line in .pdf format
from SSRN.com ($5) for electronic delivery.
Machine-readable bibliographic record -
MARC,
RIS,
BibTeX