Taylor & Francis Group
Browse

Estimation and Inference for High-Dimensional Generalized Linear Models with Knowledge Transfer

Download (473.46 kB)
journal contribution
posted on 2023-02-28, 19:00 authored by Sai Li, Linjun Zhang, T. Tony Cai, Hongzhe Li

Transfer learning provides a powerful tool for incorporating data from related studies into a target study of interest. In epidemiology and medical studies, the classification of a target disease could borrow information across other related diseases and populations. In this work, we consider transfer learning for high-dimensional Generalized Linear Models (GLMs). A novel algorithm, TransHDGLM, that integrates data from the target study and the source studies is proposed. Minimax rate of convergence for estimation is established and the proposed estimator is shown to be rate-optimal. Statistical inference for the target regression coefficients is also studied. Asymptotic normality for a debiased estimator is established, which can be used for constructing coordinate-wise confidence intervals of the regression coefficients. Numerical studies show significant improvement in estimation and inference accuracy over GLMs that only use the target data. The proposed methods are applied to a real data study concerning the classification of colorectal cancer using gut microbiomes, and are shown to enhance the classification accuracy in comparison to methods that only use the target data. Supplementary materials for this article are available online.

Funding

This research was supported by NIH grants R01GM123056 and R01GM129781. Sai Li’s research was also supported by NSFC(grant no. 12201630), the Fundamental Research Funds for the Central Universities, and the Research Funds of Renmin University of China. Linjun Zhang’s research was also supported in part by NSF grant DMS-2015378. Tony Cai’s research was also supported in part by NSF grant DMS-2015259.

History

Usage metrics

    Journal of the American Statistical Association

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC