DOUBLY DEBIASED LASSO: HIGH-DIMENSIONAL INFERENCE UNDER HIDDEN CONFOUNDING

Zijian Guo, Domagoj Ćevid, Peter Bühlmann

Research output: Contribution to journalArticlepeer-review

3 Scopus citations

Abstract

Inferring causal relationships or related associations from observational data can be invalidated by the existence of hidden confounding. We focus on a high-dimensional linear regression setting, where the measured covariates are affected by hidden confounding and propose the doubly debiased lasso estimator for individual components of the regression coefficient vector. Our advocated method simultaneously corrects both the bias due to estimation of high-dimensional parameters as well as the bias caused by the hidden confounding. We establish its asymptotic normality and also prove that it is efficient in the Gauss–Markov sense. The validity of our methodology relies on a dense confounding assumption, that is, that every confounding variable affects many covariates. The finite sample performance is illustrated with an extensive simulation study and a genomic application. The method is implemented by the DDL package available from CRAN.

Original languageEnglish (US)
Pages (from-to)1320-1347
Number of pages28
JournalAnnals of Statistics
Volume50
Issue number3
DOIs
StatePublished - Jun 2022

All Science Journal Classification (ASJC) codes

  • Statistics and Probability
  • Statistics, Probability and Uncertainty

Keywords

  • Causal inference
  • dense confounding
  • linear model
  • spectral deconfounding
  • structural equation model

Fingerprint

Dive into the research topics of 'DOUBLY DEBIASED LASSO: HIGH-DIMENSIONAL INFERENCE UNDER HIDDEN CONFOUNDING'. Together they form a unique fingerprint.

Cite this