Formal Concept Analysis Applications in Bioinformatics

被引:13
作者
Roscoe, Sarah [1 ]
Khatri, Minal [1 ]
Voshall, Adam [1 ]
Batra, Surinder [2 ]
Kaur, Sukhwinder [2 ]
Deogun, Jitender [1 ]
机构
[1] Univ Nebraska, Sch Comp, Lincoln, NE 68588 USA
[2] Univ Nebraska, Med Ctr, Dept Biochem & Mol Biol, Omaha, NE 68198 USA
关键词
Biomarkers discovery; biomedical ontologies; cancer classification; disease classification; drug design and development; formal concept analysis; gene expression data; healthcare informatics; next-generation sequencing data analysis; phylogeny; protein-protein interactions; GENE-EXPRESSION; FEATURE-SELECTION; CONCEPT LATTICES; SIMILARITY; REDUCTION; ALGORITHM; IDENTIFICATION; ATTRIBUTES; ANALYTICS; SEQUENCE;
D O I
10.1145/3554728
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The bioinformatics discipline seeks to solve problems in biology with computational theories and methods. Formal concept analysis (FCA) is one such theoretical model, based on partial orders. FCA allows the user to examine the structural properties of data based on which subsets of the dataset depend on each other. This article surveys the current literature related to the use of FCA for bioinformatics. The survey begins with a discussion of FCA, its hierarchical advantages, several advanced models of FCA, and lattice management strategies. It then examines how FCA has been used in bioinformatics applications, followed by future prospects of FCA in those areas. The applications addressed include gene data analysis (with next-generation sequencing), biomarkers discovery, protein-protein interaction, disease analysis (including COVID-19, cancer, and others), drug design and development, healthcare informatics, biomedical ontologies, and phylogeny. Some of the most promising prospects of FCA are identifying influential nodes in a network representing protein-protein interactions, determining critical concepts to discover biomarkers, integrating machine learning and deep learning for cancer classification, and pattern matching for next-generation sequencing.
引用
收藏
页数:40
相关论文
共 174 条
[1]  
Ahmed Z., 2021, 2021 IEEE INT C BIOI, P1241
[2]   Support vector machines combined with feature selection for breast cancer diagnosis [J].
Akay, Mehmet Fatih .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) :3240-3247
[3]   Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays [J].
Alon, U ;
Barkai, N ;
Notterman, DA ;
Gish, K ;
Ybarra, S ;
Mack, D ;
Levine, AJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) :6745-6750
[4]  
Amin II, 2015, STUD BIG DATA, V9, P375, DOI 10.1007/978-3-319-11056-1_13
[5]  
Amin II, 2013, 2013 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), P521, DOI 10.1109/ICACCI.2013.6637226
[6]  
Amin II, 2012, INT CONF INTELL SYST, P764, DOI 10.1109/ISDA.2012.6416633
[7]   A Visual Analytics Technique for Exploring Gene Expression in the Developing Mouse Embryo [J].
Andrews, Simon ;
McLeod, Kenneth .
GRAPH-BASED REPRESENTATION AND REASONING (ICCS 2018), 2018, 10872 :137-151
[8]   Creating corroborated crisis reports from social media data through formal concept analysis [J].
Andrews, Simon ;
Gibson, Helen ;
Domdouzis, Konstantinos ;
Akhgar, Babak .
JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2016, 47 (02) :287-312
[9]   Gene Co-Expression in Mouse Embryo Tissues [J].
Andrews, Simon ;
McLeod, Kenneth .
INTERNATIONAL JOURNAL OF INTELLIGENT INFORMATION TECHNOLOGIES, 2013, 9 (04) :55-68
[10]  
Andrews S, 2011, LECT NOTES ARTIF INT, V6828, P50, DOI 10.1007/978-3-642-22688-5_4