Detecting hidden confounding in observational data using multiple environments

Preprint (2023)
Author(s)

R.K.A. Karlsson (TU Delft - Electrical Engineering, Mathematics and Computer Science)

J.H. Krijthe (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Research Group
Pattern Recognition and Bioinformatics
URL related publication
https://arxiv.org/abs/2205.13935 Accepted author manuscript
More Info
expand_more
Publication Year
2023
Language
English
Research Group
Pattern Recognition and Bioinformatics
Downloads counter
108

Abstract

A common assumption in causal inference from observational data is that there is no hidden confounding. Yet it is, in general, impossible to verify this assumption from a single dataset. Under the assumption of independent causal mechanisms underlying the data-generating process, we demonstrate a way to detect unobserved confounders when having multiple observational datasets coming from different environments. We present a theory for testable conditional independencies that are only absent when there is hidden confounding and examine cases where we violate its assumptions: degenerate & dependent mechanisms, and faithfulness violations. Additionally, we propose a procedure to test these independencies and study its empirical finite-sample behavior using simulation studies and semi-synthetic data based on a real-world dataset. In most cases, the proposed procedure correctly predicts the presence of hidden confounding, particularly when the confounding bias is large.