Possibility of using overrule to evaluate overlap in causal inference
What is the performance of overrule in identifying overlap for different types of datasets?
S. Cheng (TU Delft - Electrical Engineering, Mathematics and Computer Science)
J.H. Krijthe – Mentor (TU Delft - Electrical Engineering, Mathematics and Computer Science)
R.K.A. Karlsson – Coach (TU Delft - Electrical Engineering, Mathematics and Computer Science)
More Info
expand_more
source code of this project
https://github.com/ShukunCheng/Rule-Based-Overlap-EstimatorOther than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.
Abstract
Causal inference is a widely recognized concept in various domains, including medicine, for estimating the effect of a medication on a certain disease. During this estimation, overlap is commonly used to eliminate the error caused by other features. However, finding the real overlap region in practice is challenging due to the limited sample size and unknown data distribution. Therefore, some machine-learning methods have been proposed to estimate the overlap region. One such method is Overrule, a Python package proposed by Oberst et al. Overrule is based on rule-based lassification and estimates the overlap region by interpreting it as several rules across the features. However, it is still unclear how Overrule performs under different circumstances. Thus, the primary bjective of this project is to test the performance of Overrule with different datasets. To accomplish this, a series of tests are built and executed to evaluate the performance of Overrule in diverse scenarios.