Hierarchical Total Variations and Doubly Penalized ANOVA Modeling for Multivariate Nonparametric Regression

Ting Yang, Zhiqiang Tan

Research output: Contribution to journalArticlepeer-review

Abstract

For multivariate nonparametric regression, functional analysis of variance (ANOVA) modeling aims to capture the relationship between a response and covariates by decomposing the unknown function into various components, representing main effects, two-way interactions, etc. Such an approach has been pursued explicitly in smoothing spline ANOVA modeling and implicitly in various greedy methods such as MARS. We develop a new method for functional ANOVA modeling, based on doubly penalized estimation using total-variation and empirical-norm penalties, to achieve sparse selection of component functions and their basis functions. For this purpose, we formulate a new class of hierarchical total variations, which measures total variations at different levels including main effects and multi-way interactions, possibly after some order of differentiation. Furthermore, we derive suitable basis functions for multivariate splines such that the hierarchical total variation can be represented as a regular Lasso penalty, and hence we extend a previous backfitting algorithm to handle doubly penalized estimation for ANOVA modeling. We present extensive numerical experiments on simulations and real data to compare our method with existing methods including MARS, tree boosting, and random forest. The results are very encouraging and demonstrate notable gains from our method in prediction or classification accuracy and simplicity of the fitted functions. Supplementary materials for this article are available online.

Original languageEnglish (US)
Pages (from-to)848-862
Number of pages15
JournalJournal of Computational and Graphical Statistics
Volume30
Issue number4
DOIs
StatePublished - 2021

All Science Journal Classification (ASJC) codes

  • Statistics and Probability
  • Statistics, Probability and Uncertainty
  • Discrete Mathematics and Combinatorics

Keywords

  • ANOVA model
  • Additive model
  • Boosting
  • Nonparametric regression
  • Penalized estimation
  • Total variation

Fingerprint

Dive into the research topics of 'Hierarchical Total Variations and Doubly Penalized ANOVA Modeling for Multivariate Nonparametric Regression'. Together they form a unique fingerprint.

Cite this