Inference in Additively Separable Models With a High-Dimensional Set of Conditioning Variables
This article studies nonparametric series estimation and inference for the effect of a single variable of interest x on an outcome y in the presence of potentially high-dimensional conditioning variables z. The context is an additively separable model. The model is high-dimensional in the sense that the series of approximating functions forcan have more terms than the sample size, thereby allowing z potentially to have very many measured characteristics. The model is required to be approximately sparse:can be approximated using only a small subset of series terms whose identities are unknown. This article proposes an estimation and inference method forcalled Post-Nonparametric Double Selection, which is a generalization of Post-Double Selection. Rates of convergence and asymptotic normality for the estimator are derived and hold over a large class of sparse data-generating processes. A simulation study illustrates finite sample estimation properties of the proposed estimator and coverage properties of the corresponding confidence intervals. Finally, an empirical application to college admissions policy demonstrates the practical implementation of the proposed method.