The Forest or the Trees? Tackling Simpson's Paradox with Classification Trees

被引:16
作者
Shmueli, Galit [1 ]
Yahav, Inbal [2 ]
机构
[1] Natl Tsing Hua Univ, Coll Technol Management, Inst Serv Sci, Hsinchu 30013, Taiwan
[2] Bar Ilan Univ, Grad Sch Business, IL-52900 Ramat Gan, Israel
关键词
decision making; data aggregation; Simpson's paradox; casual effect; classification and regression trees; TRADE-OFF; PRINCIPLE; MODEL; BIG;
D O I
10.1111/poms.12819
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Studying causal effects is central to research in operations management in manufacturing and services, from evaluating prevention procedures, to effects of policies and new operational technologies and practices. The growing availability of micro-level data creates challenges for researchers and decision makers in terms of choosing the right level of data aggregation for inference and decisions. Simpson's paradox describes the case where the direction of a causal effect is reversed in the aggregated data compared to the disaggregated data. Detecting whether Simpson's paradox occurs in a dataset used for decision making is therefore critical. This study introduces the use of Classification and Regression Trees for automated detection of potential Simpson's paradoxes in data with few or many potential confounding variables, and even with large samples (big data). Our approach relies on the tree structure and the location of the cause vs. the confounders in the tree. We discuss theoretical and computational aspects of the approach and illustrate it using several real applications in e-governance and healthcare.
引用
收藏
页码:696 / 716
页数:21
相关论文
共 33 条
[1]   Simpson's paradox [J].
Alin, Aylin .
WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2010, 2 (02) :247-250
[2]  
[Anonymous], 2012, CATEGORICAL DATA ANA
[3]  
[Anonymous], 2009, CAUSALITY MODELS REA
[4]  
[Anonymous], 1980, J Roy Stat Soc: Ser C (Appl Stat), DOI [DOI 10.2307/2986296, 10.2307/2986296]
[5]   Resurrecting the Third Variable: A Critique of Pearl's Causal Analysis of Simpson's Paradox [J].
Armistead, Timothy W. .
AMERICAN STATISTICIAN, 2014, 68 (01) :1-7
[6]   On the Efficiency-Fairness Trade-off [J].
Bertsimas, Dimitris ;
Farias, Vivek F. ;
Trichakis, Nikolaos .
MANAGEMENT SCIENCE, 2012, 58 (12) :2234-2250
[7]  
Bishop Y. M. M., 1975, DISCRETE MULTIVARIAT, V58, P2234
[8]   SIMPSONS PARADOX AND SURE-THING PRINCIPLE [J].
BLYTH, CR .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1972, 67 (33) :364-&
[9]   Normative and descriptive analyses of Simpson's Paradox in decision making [J].
Curley, SP ;
Browne, GJ .
ORGANIZATIONAL BEHAVIOR AND HUMAN DECISION PROCESSES, 2001, 84 (02) :308-333
[10]  
Department of Information Technology Ministry of Communications and Information Technology Government of India and Indian Institute of Management Ahmedabad, 2008, IMP ASS E GOV PROJ