RNA-Seq analysis for breast cancer detection: a study on paired tissue samples using hybrid optimization and deep learning techniques

被引:18
作者
Yaqoob, Abrar [1 ]
Verma, Navneet Kumar [1 ]
Aziz, Rabia Musheer [2 ]
Shah, Mohd Asif [3 ,4 ,5 ]
机构
[1] VIT Bhopal Univ, Sch Adv Sci & Language, Bhopal 466114, Madhya Pradesh, India
[2] State Planning Inst, Planning Dept, New Div, Lucknow 226001, Utter Pradesh, India
[3] Kardan Univ, Dept Econ, Parwane Du, Kabul 1001, Afghanistan
[4] Lovely Profess Univ, Div Res & Dev, Phagwara 144001, Punjab, India
[5] Chitkara Univ, Inst Engn & Technol, Ctr Res Impact & Outcome, Rajpura 140401, Punjab, India
基金
英国科研创新办公室;
关键词
Breast cancer; Harris Hawk algorithm; Whale optimization algorithm; Deep learning; GENE SELECTION; ARTIFICIAL-INTELLIGENCE; ALGORITHM; CLASSIFICATION; COMBINATION;
D O I
10.1007/s00432-024-05968-z
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
ProblemBreast cancer is a leading global health issue, contributing to high mortality rates among women. The challenge of early detection is exacerbated by the high dimensionality and complexity of gene expression data, which complicates the classification process.AimThis study aims to develop an advanced deep learning model that can accurately detect breast cancer using RNA-Seq gene expression data, while effectively addressing the challenges posed by the data's high dimensionality and complexity.MethodsWe introduce a novel hybrid gene selection approach that combines the Harris Hawk Optimization (HHO) and Whale Optimization (WO) algorithms with deep learning to improve feature selection and classification accuracy. The model's performance was compared to five conventional optimization algorithms integrated with deep learning: Genetic Algorithm (GA), Artificial Bee Colony (ABC), Cuckoo Search (CS), and Particle Swarm Optimization (PSO). RNA-Seq data was collected from 66 paired samples of normal and cancerous tissues from breast cancer patients at the Jawaharlal Nehru Cancer Hospital & Research Centre, Bhopal, India. Sequencing was performed by Biokart Genomics Lab, Bengaluru, India.ResultsThe proposed model achieved a mean classification accuracy of 99.0%, consistently outperforming the GA, ABC, CS, and PSO methods. The dataset comprised 55 female breast cancer patients, including both early and advanced stages, along with age-matched healthy controls.ConclusionOur findings demonstrate that the hybrid gene selection approach using HHO and WO, combined with deep learning, is a powerful and accurate tool for breast cancer detection. This approach shows promise for early detection and could facilitate personalized treatment strategies, ultimately improving patient outcomes.
引用
收藏
页数:27
相关论文
共 62 条
[1]   A Novel Weighted Support Vector Machine Based on Particle Swarm Optimization for Gene Selection and Tumor Classification [J].
Abdi, Mohammad Javad ;
Hosseini, Seyed Mohammad ;
Rezghi, Mansoor .
COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2012, 2012
[2]   Puma optimizer (PO): a novel metaheuristic optimization algorithm and its application in machine learning [J].
Abdollahzadeh, Benyamin ;
Khodadadi, Nima ;
Barshandeh, Saeid ;
Trojovsky, Pavel ;
Gharehchopogh, Farhad Soleimanian ;
El-kenawy, El-Sayed M. ;
Abualigah, Laith ;
Mirjalili, Seyedali .
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (04) :5235-5283
[3]   Metaheuristic Algorithms on Feature Selection: A Survey of One Decade of Research (2009-2019) [J].
Agrawal, Prachi ;
Abutarboush, Hattan F. ;
Ganesh, Talari ;
Mohamed, Ali Wagdy .
IEEE ACCESS, 2021, 9 :26766-26791
[4]   Harris hawks optimization: a comprehensive review of recent variants and applications [J].
Alabool, Hamzeh Mohammad ;
Alarabiat, Deemah ;
Abualigah, Laith ;
Heidari, Ali Asghar .
NEURAL COMPUTING & APPLICATIONS, 2021, 33 (15) :8939-8980
[5]   Regularized logistic regression with adjusted adaptive elastic net for gene selection in high dimensional cancer classification [J].
Algamal, Zakariya Yahya ;
Lee, Muhammad Hisyam .
COMPUTERS IN BIOLOGY AND MEDICINE, 2015, 67 :136-145
[6]   Improved intelligent water drop-based hybrid feature selection method for data [J].
Alhenawi, Esra'a ;
Al-Sayyed, Rizik ;
Hudaib, Amjad ;
Mirjalili, Seyedali .
COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2023, 103
[7]  
Ali AM, 2023, International Journal of Mathematics Statistics and Computer Science, V2, P114, DOI [10.59543/ijmscs.v2i.8703, 10.59543/ijmscs.v2i.8703, DOI 10.59543/IJMSCS.V2I.8703]
[8]   Hybrid Filter and Genetic Algorithm-Based Feature Selection for Improving Cancer Classification in High-Dimensional Microarray Data [J].
Ali, Waleed ;
Saeed, Faisal .
PROCESSES, 2023, 11 (02)
[9]   A New Algorithm for Cancer Biomarker Gene Detection Using Harris Hawks Optimization [J].
AlMazrua, Halah ;
AlShamlan, Hala .
SENSORS, 2022, 22 (19)
[10]   mRMR-ABC: A Hybrid Gene Selection Algorithm for Cancer Classification Using Microarray Gene Expression Profiling [J].
Alshamlan, Hala ;
Badr, Ghada ;
Alohali, Yousef .
BIOMED RESEARCH INTERNATIONAL, 2015, 2015