Taylor & Francis Group
Browse
1/1
3 files

Individualized Multidirectional Variable Selection

Version 2 2020-01-23, 18:48
Version 1 2020-01-04, 03:08
dataset
posted on 2020-01-23, 18:48 authored by Xiwei Tang, Fei Xue, Annie Qu

In this article, we propose a heterogeneous modeling framework which achieves individual-wise feature selection and heterogeneous covariates’ effects subgrouping simultaneously. In contrast to conventional model selection approaches, the new approach constructs a separation penalty with multidirectional shrinkages, which facilitates individualized modeling to distinguish strong signals from noisy ones and selects different relevant variables for different individuals. Meanwhile, the proposed model identifies subgroups among which individuals share similar covariates’ effects, and thus improves individualized estimation efficiency and feature selection accuracy. Moreover, the proposed model also incorporates within-individual correlation for longitudinal data to gain extra efficiency. We provide a general theoretical foundation under a double-divergence modeling framework where the number of individuals and the number of individual-wise measurements can both diverge, which enables inference on both an individual level and a population level. In particular, we establish a strong oracle property for the individualized estimator to ensure its optimal large sample property under various conditions. An efficient ADMM algorithm is developed for computational scalability. Simulation studies and applications to post-trauma mental disorder analysis with genetic variation and an HIV longitudinal treatment study are illustrated to compare the new approach to existing methods. Supplementary materials for this article are available online.

Funding

The authors would like to acknowledge support for this project from the National Science Foundation grants DMS-1613190, DMS-1821198, the National Institute of Health grant 7R01MD011728-03, and Dr. Monica Uddin to provide the PTSD data.

History