Machine-learning diagnostics of breast cancer using piRNA biomarkers

被引:0
作者
Zhao, Amy R. [1 ]
Kouznetsova, Valentina L. [2 ,3 ,4 ]
Kesari, Santosh [5 ]
Tsigelny, Igor F. [2 ,3 ,4 ,6 ]
机构
[1] CureSci Inst, Scholars Program, San Diego, CA USA
[2] Univ Calif San Diego, San Diego Supercomp Ctr, La Jolla, CA USA
[3] BIAna Inst, San Diego, CA USA
[4] CureScience Inst, San Diego, CA USA
[5] Pacific Neurosci Inst, Dept Neurooncol, Santa Monica, CA USA
[6] Univ Calif San Diego, Dept Neurosci, La Jolla, CA USA
关键词
Biomarkers; breast cancer; blood-based piRNAs; circulating piRNAs; machine learning; PIWI-INTERACTING RNA; BIOGENESIS; EXPRESSION; HALLMARKS; PROTEINS; ELEMENTS;
D O I
10.1080/1354750X.2025.2461067
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background and objectivesPrior studies have shown that small non-coding RNAs (sncRNAs) are associated with cancer occurrence or development. Recently, a newly discovered class of small ncRNAs known as PIWI-interacting RNAs (piRNAs) have been found to play a vital role in physiological processes and cancer initiation. This study aims to utilize piRNAs as innovative, noninvasive diagnostic biomarkers for breast cancer. Our objective is to develop computational methods that leverage piRNA attributes for breast cancer prediction and its application in diagnostics.MethodsWe created a set of piRNA sequence descriptors using information extracted from the piRNA sequences. To ensure accuracy, we found a path to convert non-standard piRNA names to standard ones to enable precise identification of these sequences. Using these descriptors, we applied machine-learning (ML) techniques in WEKA (Waikato Environment for Knowledge Analysis) to a dataset of piRNA to assess the predictive accuracy of the following classifiers: Logistic Regression model, Sequential Minimal Optimization (SMO), Random Forest classifier, and Logistic Model Tree (LMT). Furthermore, we performed Shapley additive explanations (SHAP) Analysis to understand which descriptors were the most relevant to the prediction accuracy. The ML models were then validated on an independent dataset to evaluate their effectiveness in predicting breast cancer.ResultsThe top three performing classifiers in WEKA were Logistic Regression, SMO, and LMT. The Logistic Regression model achieved an accuracy of 90.7% in predicting breast cancer, while SMO and LMT attained 89.7% and 85.65%, respectively.ConclusionsOur study demonstrates the effectiveness of using ML-based piRNA classifiers in diagnosing breast cancer and contributes to the growing body of evidence supporting piRNAs as biomarkers in cancer diagnosis. However, additional research is needed to validate these findings and further assess the clinical applicability of this approach.
引用
收藏
页码:167 / 177
页数:11
相关论文
共 47 条
  • [1] Aravind V.A., Kouznetsova V.L., Kesari S., Tsigelny I.F., Using machine learning and miRNA for the diagnosis of esophageal cancer, J Appl Lab Med, 9, 4, pp. 684-695, (2024)
  • [2] Babski J., Maier L.-K., Heyer R., Jaschinski K., Prasse D., Jager D., Randau L., Schmitz R.A., Marchfelder A., Soppa J., Et al., Small regulatory RNAs in archaea, RNA Biol, 11, 5, pp. 484-493, (2014)
  • [3] Chalbatani G.M., Dana H., Gharagouzloo E., Memari F., Ghasemi M., Akbarian A., Rakhshani N., Kheirandish P., Mohammadi F., Jalali S.A., PIWI-interacting RNAs (piRNAs) in cancer: biogenesis, function, and clinical aspects, J Hematol Oncol, 11, 1, (2018)
  • [4] Dakal T.C., Dhabhai B., Pant A., Moar K., Chaudhary K., Yadav V., Ranga V., Sharma N.K., Kumar A., Maurya P.K., Et al., Oncogenes and tumor suppressor genes: functions and roles in cancers, MedComm (2020), 5, 6, (2024)
  • [5] Deng X., Liao T., Xie J., Kang D., He Y., Sun Y., Wang Z., Jiang Y., Miao X., Yan Y., Et al., The burgeoning importance of PIWI-interacting RNAs in cancer progression, Sci China Life Sci, 67, 4, pp. 653-662, (2024)
  • [6] Grosshans H., Filipowicz W., Molecular biology: the expanding world of small RNAs, Nature, 451, 7177, pp. 414-416, (2008)
  • [7] Guo B., Li D., Du L., Zhu X., piRNAs: biogenesis and their potential roles in cancer, Cancer Metastasis Rev, 39, 2, pp. 567-575, (2020)
  • [8] Hanahan D., Hallmarks of cancer: new dimensions, Cancer Discov, 12, 1, pp. 31-46, (2022)
  • [9] Hanahan D., Weinberg R.A., The hallmarks of cancer, Cell, 100, 1, pp. 57-70, (2000)
  • [10] Hanahan D., Weinberg R.A., Hallmarks of cancer: the next generation, Cell, 144, 5, pp. 646-674, (2011)