Taylor & Francis Group
Browse
1/1
3 files

Inference in Additively Separable Models With a High-Dimensional Set of Conditioning Variables

Version 2 2020-05-14, 13:43
Version 1 2020-04-09, 12:28
dataset
posted on 2020-05-14, 13:43 authored by Damian Kozbur

This article studies nonparametric series estimation and inference for the effect of a single variable of interest x on an outcome y in the presence of potentially high-dimensional conditioning variables z. The context is an additively separable modelE[y|x,z]=g0(x)+h0(z). The model is high-dimensional in the sense that the series of approximating functions forh0(z)can have more terms than the sample size, thereby allowing z potentially to have very many measured characteristics. The model is required to be approximately sparse:h0(z)can be approximated using only a small subset of series terms whose identities are unknown. This article proposes an estimation and inference method forg0(x)called Post-Nonparametric Double Selection, which is a generalization of Post-Double Selection. Rates of convergence and asymptotic normality for the estimator are derived and hold over a large class of sparse data-generating processes. A simulation study illustrates finite sample estimation properties of the proposed estimator and coverage properties of the corresponding confidence intervals. Finally, an empirical application to college admissions policy demonstrates the practical implementation of the proposed method.

Funding

I gratefully acknowledge financial support from the ETH Postdoctoral Fellowship.

History