Genetic data visualization using literature text-based neural networks: Examples associated with myocardial infarction

被引:0
|
作者
Moon, Jihye [1 ,2 ]
Posada-Quintero, Hugo F. [1 ]
Chon, Ki H. [1 ]
机构
[1] Univ Connecticut, Dept Biomed Engn, Storrs, CT 06269 USA
[2] Univ Connecticut Storrs, Biomed Engn Dept, Engn & Sci Bldg ESB,Room 407, Storrs, CT 06269 USA
关键词
Explainable Artificial Intelligence; Natural language processing; Unsupervised learning; Cross -modal representation; Data visualization; Cardiovascular Disease risk prediction; DIMENSIONALITY REDUCTION; METAANALYSIS; MODEL;
D O I
10.1016/j.neunet.2023.05.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data visualization is critical to unraveling hidden information from complex and high-dimensional data. Interpretable visualization methods are critical, especially in the biology and medical fields, however, there are limited effective visualization methods for large genetic data. Current visualization methods are limited to lower-dimensional data and their performance suffers if there is missing data. In this study, we propose a literature-based visualization method to reduce high-dimensional data without compromising the dynamics of the single nucleotide polymorphisms (SNP) and textual interpretability. Our method is innovative because it is shown to (1) preserves both global and local structures of SNP while reducing the dimension of the data using literature text representations, and (2) enables interpretable visualizations using textual information. For performance evaluations, we examined the proposed approach to classify various classification categories including race, myocardial infarction event age groups, and sex using several machine learning models on the literature-derived SNP data. We used visualization approaches to examine clustering of data as well as quantitative performance metrics for the classification of the risk factors examined above. Our method outperformed all popular dimensionality reduction and visualization methods for both classification and visualization, and it is robust against missing and higher-dimensional data. Moreover, we found it feasible to incorporate both genetic and other risk information obtained from literature with our method.& COPY; 2023 The Author(s). Published by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:562 / 595
页数:34
相关论文
共 50 条
  • [31] Risk assessment for acute myocardial infarction patients using Artificial Neural Networks
    Sepúlveda, J
    Soria, E
    Camps, G
    Sanz, G
    Marrugat, J
    Gómez, L
    COMPUTERS IN CARDIOLOGY 2001, VOL 28, 2001, 28 : 573 - 575
  • [32] On comparison of tree-based methods and neural networks using genetic data.
    Chen, CH
    Chen, CL
    Chang, CJ
    Fann, CSJ
    AMERICAN JOURNAL OF HUMAN GENETICS, 2003, 73 (05) : 603 - 603
  • [33] cDNA microarray data based classification of cancers using neural networks and genetic algorithms
    Cho, HS
    Kim, TS
    Wee, JW
    Jeon, SM
    Lee, CH
    NANOTECH 2003, VOL 1, 2003, : 28 - 31
  • [34] Single Document Extractive Text Summarization Using Neural Networks and Genetic Algorithm
    Chatterjee, Niladri
    Jain, Gautam
    Bajwa, Gurkirat Singh
    INTELLIGENT COMPUTING, VOL 1, 2019, 858 : 338 - 358
  • [35] Neural network-based visualization using clustered data
    Ivanikovas, Sergejus
    Dzemyda, Gintautas
    Medvedev, Viktor
    20TH INTERNATIONAL CONFERENCE, EURO MINI CONFERENCE CONTINUOUS OPTIMIZATION AND KNOWLEDGE-BASED TECHNOLOGIES, EUROPT'2008, 2008, : 335 - 341
  • [36] Treatment of missing data using neural networks and genetic algorithms
    Abdella, M
    Marwala, T
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), VOLS 1-5, 2005, : 598 - 603
  • [37] Protein Function Prediction using Text-based Features extracted from the Biomedical Literature: The CAFA Challenge
    Wong, Andrew
    Shatkay, Hagit
    BMC BIOINFORMATICS, 2013, 14
  • [38] Enhancing Academic Literature Review through Relevance Recommendation Using Bibliometric and Text-based Features for Classification
    Rubio, Thiago R. P. M.
    Gulo, Carlos A. S. J.
    2016 11TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI), 2016,
  • [39] A deep learning approach to text-based personality prediction using multiple data sources mapping
    Joshua Johnson Sirasapalli
    Ramakrishna Murty Malla
    Neural Computing and Applications, 2023, 35 : 20619 - 20630
  • [40] Using text-based synchronous chat to offer therapeutic support to students: A systematic review of the research literature
    Ersahin, Zehra
    Hanley, Terry
    HEALTH EDUCATION JOURNAL, 2017, 76 (05) : 531 - 543