Abstract
We study the properties of the Lasso in the high-dimensional partially linear model where the number of variables in the linear part can be greater than the sample size. We use truncated series expansion based on polynomial splines to approximate the nonparametric component in this model. Under a sparsity assumption on the regression coefficients of the linear component and some regularity conditions, we derive the oracle inequalities for the prediction risk and the estimation error. We also provide sufficient conditions under which the Lasso estimator is selection consistent for the variables in the linear part of the model. In addition, we derive the rate of convergence of the estimator of the nonparametric function. We conduct simulation studies to evaluate the finite sample performance of variable selection and nonparametric function estimation.
Original language | English |
---|---|
Pages (from-to) | 769-788 |
Number of pages | 20 |
Journal | Science China Mathematics |
Volume | 59 |
Issue number | 4 |
DOIs | |
Publication status | Published - 1 Apr 2016 |
Externally published | Yes |
Keywords
- irrepresentable condition
- Lasso
- restricted eigenvalue
- semiparametric models
- sparsity
ASJC Scopus subject areas
- General Mathematics