Abstract
We consider a high-dimensional linear regression problem, where the covariates (features) are ordered in some meaningful way, and the number of covariates p can be much larger than the sample size n. The fused lasso of Tibshirani et al. is designed especially to tackle this type of problems; it yields sparse coefficients and selects grouped variables, and encourages local constant coefficient profile within each group. However, in some applications, the effects of different features within a group might be different and change smoothly. In this article, we propose a new spline-lasso or more generally, spline-MCP to better capture the different effects within the group. The newly proposed method is very easy to implement since it can be easily turned into a lasso or MCP problem. Simulations show that the method works very effectively both in feature selection and prediction accuracy. A real application is also given to illustrate the benefits of the method. Supplementary materials for this article are available online.
| Original language | English |
|---|---|
| Pages (from-to) | 288-297 |
| Number of pages | 10 |
| Journal | Journal of the American Statistical Association |
| Volume | 111 |
| Issue number | 513 |
| DOIs | |
| Publication status | Published - 5 May 2016 |
Bibliographical note
Publisher Copyright:© 2016 American Statistical Association.
Keywords
- Oracle inequality
- Spline-MCP
- Spline-lasso
- Variable selection
Fingerprint
Dive into the research topics of 'Spline-Lasso in High-Dimensional Linear Regression'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver