Taylor & Francis Group
Browse
uasa_a_1671200_sm4781.pdf (77.83 kB)

Optimal Designs of Two-Phase Studies

Download (77.83 kB)
Version 2 2019-10-29, 21:58
Version 1 2019-10-14, 14:03
journal contribution
posted on 2019-10-29, 21:58 authored by Ran Tao, Donglin Zeng, Dan-Yu Lin

The two-phase design is a cost-effective sampling strategy to evaluate the effects of covariates on an outcome when certain covariates are too expensive to be measured on all study subjects. Under such a design, the outcome and inexpensive covariates are measured on all subjects in the first phase and the first-phase information is used to select subjects for measurements of expensive covariates in the second phase. Previous research on two-phase studies has focused largely on the inference procedures rather than the design aspects. We investigate the design efficiency of the two-phase study, as measured by the semiparametric efficiency bound for estimating the regression coefficients of expensive covariates. We consider general two-phase studies, where the outcome variable can be continuous, discrete, or censored, and the second-phase sampling can depend on the first-phase data in any manner. We develop optimal or approximately optimal two-phase designs, which can be substantially more efficient than the existing designs. We demonstrate the improvements of the new designs over the existing ones through extensive simulation studies and two large medical studies. Supplementary materials for this article are available online.

Funding

This research was supported by the National Institute of Health grants R01HG009974, R01GM047845, P01CA142538, R01HL094786, U01HG009086, and R01AI029168.

History