Improving Causal Bayesian Networks Using Expertise in Authoritative Medical Ontologies

被引:1
|
作者
Hu, Hengyi [1 ]
Kerschberg, Larry [1 ]
机构
[1] George Mason Univ, Fairfax, VA 22030 USA
来源
关键词
Patient data; data mining; data management; Bayesian networks; causal inference; causal networks; causality; healthcare data; healthcare information technology; ontology; ontology evolution; GENERALIZED ANXIETY DISORDER; PANIC DISORDER; DYSTHYMIC DISORDER; DEPRESSION; IRRITABILITY; INSOMNIA; SEQUENCE; PREVALENCE; EVOLUTION; DISCOVERY;
D O I
10.1145/3604561
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Discovering causal relationships among symptoms is a topical issue in the analysis of observational patient datasets. A Causal Bayesian Network (CBN) is a popular analytical framework for causal inference. While there are many methods and algorithms capable of learning a Bayesian network, they are reliant on the complexity and thoroughness of the algorithm and do not consider prior expertise from authoritative sources. This article proposes a novel method of extracting prior causal knowledge contained in Authoritative Medical Ontologies (AMOs) and using this prior knowledge to orient arcs in a CBN learned from observational patient data. Since AMOs are robust biomedical ontologies containing the collective knowledge of the experts who created them, utilizing the ordering information contained within them produces improved CBNs that provide additional insight into the disease domain. To demonstrate our method, we obtained prior causal ordering information among symptoms from three AMOs: (1) the Medical Dictionary for Regulatory Activities Terminology (MedDRA), (2) the International Classification of Diseases Version 10 Clinical Modification (ICD-10-CM), and (3) Systematized Nomenclature of Medicine Clinical Terms (SNOMED CT). The prior ontological knowledge from these three AMOs is then used to orient arcs in a series of CBNs learned from the National Institutes of Mental Health study on Sequenced Treatment Alternatives to Relieve Depression (STAR*D) patient dataset using the Max-Min Hill-Climbing (MMHC) algorithm. Six distinct CBNs are generated using MMHC: an unmodified baseline model using only the algorithm, three CBNs oriented with ordered-variable pairs from MedDRA, ICD-10-CM, and SNOMED CT, and two more with ordered pairs from a combination of these AMOs. The resulting CBNs modified using ordered-variable pairs significantly change the structure of the network. The agreement between the Modified networks and the Baseline ranges from 50% to 90%. A modified network using ordering information from all ontologies obtained an agreement of 50% (10 out of 20 arcs exist in both the Baseline and Modified models) while maintaining comparable predictive accuracy. This indicates that the Modified CBN reflects the causal claims in the AMOs and agrees with both the AMOs and the observational STAR*D dataset. Furthermore, the Modified models discovered new potentially causal relationships among symptoms in the model, while eliminating weaker edges in a qualitative analysis of the significance of these relationships in existing epidemiological research.
引用
收藏
页数:32
相关论文
共 50 条
  • [21] Modelling expertise for structure elucidation in organic chemistry using Bayesian networks
    Hohenner, M
    Wachsmuth, S
    Sagerer, G
    KNOWLEDGE-BASED SYSTEMS, 2005, 18 (4-5) : 207 - 215
  • [22] Modelling expertise for structure elucidation in organic chemistry using Bayesian networks
    Hohenner, M
    Wachsmuth, S
    Sagerer, G
    APPLICATIONS AND INNOVATIONS IN INTELLIGENT SYSTEMS XII, PROCEEDINGS, 2005, : 251 - 264
  • [23] Causal independence for probability assessment and inference using Bayesian networks
    Heckerman, D
    Breese, JS
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 1996, 26 (06): : 826 - 831
  • [24] Improving lipid mapping in Genome Scale Metabolic Networks using ontologies
    Nathalie Poupin
    Florence Vinson
    Arthur Moreau
    Aurélie Batut
    Maxime Chazalviel
    Benoit Colsch
    Laetitia Fouillen
    Sarah Guez
    Spiro Khoury
    Jessica Dalloux-Chioccioli
    Anthony Tournadre
    Pauline Le Faouder
    Corinne Pouyet
    Pierre Van Delft
    Fanny Viars
    Justine Bertrand-Michel
    Fabien Jourdan
    Metabolomics, 2020, 16
  • [25] Improving lipid mapping in Genome Scale Metabolic Networks using ontologies
    Poupin, Nathalie
    Vinson, Florence
    Moreau, Arthur
    Batut, Aurelie
    Chazalviel, Maxime
    Colsch, Benoit
    Fouillen, Laetitia
    Guez, Sarah
    Khoury, Spiro
    Dalloux-Chioccioli, Jessica
    Tournadre, Anthony
    Le Faouder, Pauline
    Pouyet, Corinne
    Van Delft, Pierre
    Viars, Fanny
    Bertrand-Michel, Justine
    Jourdan, Fabien
    METABOLOMICS, 2020, 16 (04)
  • [26] Using Bayesian networks to analyze medical data
    Kim, IC
    Jung, YG
    MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, PROCEEDINGS, 2003, 2734 : 317 - 327
  • [27] Development of a graphical model of causal gene regulatory networks using medical big data and Bayesian machine learning
    Park, Sung Bae
    Yoo, Changwon
    JOURNAL OF THE KOREAN MEDICAL ASSOCIATION, 2022, 65 (03): : 167 - 172
  • [28] Encoding Dependence in Bayesian Causal Networks
    Sulik, John J.
    Newlands, Nathaniel K.
    Long, Dan S.
    FRONTIERS IN ENVIRONMENTAL SCIENCE, 2017, 4
  • [29] Robust Bayesian causal estimation for causal inference in medical diagnosis
    Basu, Tathagata
    Troffaes, Matthias C. M.
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2025, 177
  • [30] A Causal Bayesian Networks Viewpoint on Fairness
    Chiappa, Silvia
    Isaac, William S.
    PRIVACY AND IDENTITY MANAGEMENT: FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY IN THE AGE OF BIG DATA, 2019, 547 : 3 - 20