Doubly penalized estimation in additive regression with high-dimensional data

Hiqiang Tan, Cun Hui Zhang

Research output: Contribution to journalArticle

1 Scopus citations

Abstract

Additive regression provides an extension of linear regression by modeling the signal of a response as a sum of functions of covariates of relatively low complexity. We study penalized estimation in high-dimensional nonparametric additive regression where functional semi-norms are used to induce smoothness of component functions and the empirical L2 norm is used to induce sparsity. The functional semi-norms can be of Sobolev or bounded variation types and are allowed to be different amongst individual component functions. We establish oracle inequalities for the predictive performance of such methods under three simple technical conditions: a sub-Gaussian condition on the noise, a compatibility condition on the design and the functional classes under consideration and an entropy condition on the functional classes. For random designs, the sample compatibility condition can be replaced by its population version under an additional condition to ensure suitable convergence of empirical norms. In homogeneous settings where the complexities of the component functions are of the same order, our results provide a spectrum of minimax convergence rates, from the so-called slow rate without requiring the compatibility condition to the fast rate under the hard sparsity or certain Lq sparsity to allow many small components in the true regression function. These results significantly broaden and sharpen existing ones in the literature.

Original languageEnglish (US)
Pages (from-to)2567-2600
Number of pages34
JournalAnnals of Statistics
Volume47
Issue number5
DOIs
StatePublished - Jan 1 2019

All Science Journal Classification (ASJC) codes

  • Statistics and Probability
  • Statistics, Probability and Uncertainty

Keywords

  • ANOVA model
  • Additive model
  • Bounded variation space
  • Highdimensional data
  • Metric entropy
  • Penalized estimation
  • Sobolev space
  • Teproducing kernel Hilbert space
  • Total variation
  • Trend filtering

Fingerprint Dive into the research topics of 'Doubly penalized estimation in additive regression with high-dimensional data'. Together they form a unique fingerprint.

Cite this