A gene pathway enrichment method based on improved TF-IDF algorithm

被引:2
作者
Xu, Shutan [1 ,2 ]
Leng, Yinhui [1 ]
Feng, Guofu [1 ]
Zhang, Chenjing [1 ]
Chen, Ming [1 ,2 ]
机构
[1] Shanghai Ocean Univ, Coll Informat Technol, Shanghai 201306, Peoples R China
[2] Minist Agr, Key Lab Fisheries Informat, Shanghai 201306, Peoples R China
关键词
Pathway enrichment; TF-IDF; Gene interaction; Gene set enrichment analysis; EXPRESSION;
D O I
10.1016/j.bbrep.2023.101421
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Gene pathway enrichment analysis is a widely used method to analyze whether a gene set is statistically enriched on certain biological pathway network. Current gene pathway enrichment methods commonly consider local importance of genes in pathways without considering the interactions between genes. In this paper, we propose a gene pathway enrichment method (GIGSEA) based on improved TF-IDF algorithm. This method employs gene interaction data to calculate the influence of genes based on the local importance in a pathway as well as the global specificity. Computational experiment result shows that, compared with traditional gene set enrichment analysis method, our proposed method in this paper can find more specific enriched pathways related to phenotype with higher efficiency.
引用
收藏
页数:8
相关论文
共 50 条
[31]   A Sentiment analysis-based hotel recommendation using TF-IDF Approach [J].
Mishra, Ram Krishn ;
Urolagin, Siddhaling ;
Jothi, Angel Arul J. .
PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND KNOWLEDGE ECONOMY (ICCIKE' 2019), 2019, :811-815
[32]   Efficient TF-IDF method for alignment-free DNA sequence similarity analysis [J].
Delibas, Emre .
JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2025, 137
[33]   Rating and Comments Mining Using TF-IDF and SO-PMI for Improved Priority Ratings [J].
Kim, Jinah ;
Moon, Nammee .
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2019, 13 (11) :5321-5334
[34]   A method for clustering unlabeled BIM objects using entropy and TF-IDF with RDF encoding [J].
Ali, Mostafa ;
Mohamed, Yasser .
ADVANCED ENGINEERING INFORMATICS, 2017, 33 :154-163
[35]   Research and realization of internet public opinion analysis based on improved TF - IDF algorithm [J].
Yang, Yanxia .
2017 16TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS TO BUSINESS, ENGINEERING AND SCIENCE (DCABES), 2017, :80-83
[36]   A Method of K-Means Clustering Based on TF-IDF for Software Requirements Documents Written in Chinese Language [J].
Zhu, Jing ;
Huang, Song ;
Shi, Yaqing ;
Wu, Kaishun ;
Wang, Yanqiu .
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (04) :736-754
[37]   Exploiting tf-idf in deep Convolutional Neural Networks for Content Based Image Retrieval [J].
Nikolaos Kondylidis ;
Maria Tzelepi ;
Anastasios Tefas .
Multimedia Tools and Applications, 2018, 77 :30729-30748
[38]   Exploiting tf-idf in deep Convolutional Neural Networks for Content Based Image Retrieval [J].
Kondylidis, Nikolaos ;
Tzelepi, Maria ;
Tefas, Anastasios .
MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (23) :30729-30748
[39]   Determining Feature-Size for Text to Numeric Conversion based on BOW and TF-IDF [J].
Alyamani, Hasan J. .
INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (01) :283-287
[40]   LogTIW:A log anomaly detection model based on TF-IDF weighted semantic features [J].
Kang, Jia ;
Zhao, Junfeng ;
Li, Zhengxin .
2024 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN 2024, 2024,