A gene pathway enrichment method based on improved TF-IDF algorithm

被引:2
作者
Xu, Shutan [1 ,2 ]
Leng, Yinhui [1 ]
Feng, Guofu [1 ]
Zhang, Chenjing [1 ]
Chen, Ming [1 ,2 ]
机构
[1] Shanghai Ocean Univ, Coll Informat Technol, Shanghai 201306, Peoples R China
[2] Minist Agr, Key Lab Fisheries Informat, Shanghai 201306, Peoples R China
关键词
Pathway enrichment; TF-IDF; Gene interaction; Gene set enrichment analysis; EXPRESSION;
D O I
10.1016/j.bbrep.2023.101421
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Gene pathway enrichment analysis is a widely used method to analyze whether a gene set is statistically enriched on certain biological pathway network. Current gene pathway enrichment methods commonly consider local importance of genes in pathways without considering the interactions between genes. In this paper, we propose a gene pathway enrichment method (GIGSEA) based on improved TF-IDF algorithm. This method employs gene interaction data to calculate the influence of genes based on the local importance in a pathway as well as the global specificity. Computational experiment result shows that, compared with traditional gene set enrichment analysis method, our proposed method in this paper can find more specific enriched pathways related to phenotype with higher efficiency.
引用
收藏
页数:8
相关论文
共 50 条
[21]   A Support Vector Machine mixed with TF-IDF Algorithm to Categorize Bengali Document [J].
Islam, Md Saiful ;
Jubayer, Fazla Elahi Md ;
Ahmed, Syed Ikhtiar .
2017 INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND COMMUNICATION ENGINEERING (ECCE), 2017, :191-196
[22]   POS Weighted TF-IDF Algorithm and its Application for an MOOC Search Engine [J].
Xu, Ruilin .
2014 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), VOLS 1-2, 2014, :868-873
[23]   A Chinese Short Text Classification Method Based on TF-IDF and Gradient Boosting Decision Tree [J].
Cheng, Yanming ;
Yu, Zhigang ;
Hu, Je ;
Yang, Mingchuan .
2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, :164-168
[24]   Microblogging Hash Tag Recommendation System Based on Semantic TF-IDF [J].
Tajbakhsh, Mir Saman ;
Bagherzadeh, Jamshid .
2016 IEEE 4TH INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD WORKSHOPS (FICLOUDW), 2016, :252-257
[25]   Research paper classification systems based on TF-IDF and LDA schemes [J].
Kim, Sang-Woon ;
Gil, Joon-Min .
HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2019, 9 (01)
[26]   TF-IDF based binary fingerprint search with vector quantization error compensation [J].
Park, Jihyun ;
Kim, Junghyun ;
Yoo, Wonyoung .
2015 INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC), 2015, :573-575
[27]   Multi Words Quran and Hadith Searching Based on News Using TF-IDF [J].
Darwiyanto, Eko ;
Pratama, Ganang Arief ;
Widowati, Sri .
2016 4TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICOICT), 2016,
[28]   Research on Keywords Variations in Linguistics Based on TF-IDF and N-gram [J].
Li Y. ;
Wen X. ;
Liu X. .
Journal of Computing and Information Technology, 2022, 30 (03) :193-204
[29]   Optimization of Associative Knowledge Graph using TF-IDF based Ranking Score [J].
Kim, Hyun-Jin ;
Baek, Ji-Won ;
Chung, Kyungyong .
APPLIED SCIENCES-BASEL, 2020, 10 (13)
[30]   A Sentiment analysis-based hotel recommendation using TF-IDF Approach [J].
Mishra, Ram Krishn ;
Urolagin, Siddhaling ;
Jothi, Angel Arul J. .
PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND KNOWLEDGE ECONOMY (ICCIKE' 2019), 2019, :811-815