Abstract
Data with high-dimensional covariates are now commonly encountered. Compared to other types of responses, research on high-dimensional data with censored survival responses is still relatively limited, and most of the existing studies have been focused on estimation and variable selection. In this study, we consider data with a censored survival response, a set of low-dimensional covariates of main interest, and a set of high-dimensional covariates that may also affect survival. The accelerated failure time model is adopted to describe survival. The goal is to conduct inference for the effects of low-dimensional covariates, while properly accounting for the high-dimensional covariates. A penalization-based procedure is developed, and its validity is established under mild and widely adopted conditions. Simulation suggests satisfactory performance of the proposed procedure, and the analysis of two cancer genetic datasets demonstrates its practical applicability.
Original language | English |
---|---|
Pages (from-to) | 877-894 |
Number of pages | 18 |
Journal | Statistica Sinica |
Volume | 29 |
Issue number | 2 |
DOIs | |
Publication status | Published - 1 Jan 2019 |
Keywords
- AFT model
- Censored survival data
- High-dimensional inference
ASJC Scopus subject areas
- Statistics and Probability
- Statistics, Probability and Uncertainty