Identifying Biomarkers Using Support Vector Machine to Understand the Racial Disparity in Triple-Negative Breast Cancer

被引:2
作者
Sahoo, Bikram [1 ,3 ]
Pinnix, Zandra [2 ]
Sims, Seth [1 ]
Zelikovsky, Alex [1 ,3 ]
机构
[1] Georgia State Univ, Dept Comp Sci, Atlanta, GA USA
[2] Univ North Carolina Wilmington, Dept Biol & Marine Biol, Wilmington, NC USA
[3] Georgia State Univ, Dept Comp Sci, Atlanta, GA 30303 USA
关键词
gene expression; machine learning and feature engineering; racial disparity; RNA sequencing; SVM; triple-negative breast cancer; GENE-EXPRESSION DATA; OVARIAN-CANCER; KLK10; EXPRESSION; METASTASIS;
D O I
10.1089/cmb.2022.0422
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
With the properties of aggressive cancer and heterogeneous tumor biology, triple-negative breast cancer (TNBC) is a type of breast cancer known for its poor clinical outcome. The lack of estrogen, progesterone, and human epidermal growth factor receptor in the tumors of TNBC leads to fewer treatment options in clinics. The incidence of TNBC is higher in African American (AA) women compared with European American (EA) women with worse clinical outcomes. The significant factors responsible for the racial disparity in TNBC are socioeconomic lifestyle and tumor biology. The current study considered the open-source gene expression data of triple-negative breast cancer samples' racial information. We implemented a state-of-the-art classification Support Vector Machine (SVM) method with a recurrent feature elimination approach to the gene expression data to identify significant biomarkers deregulated in AA women and EA women. We also included Spearman's rho and Ward's linkage method in our feature selection workflow. Our proposed method generates 24 features/genes that can classify the AA and EA samples 98% accurately. We also performed the Kaplan-Meier analysis and log-rank test on the 24 features/genes. We only discussed the correlation between deregulated expression and cancer progression with a poor survival rate of 2 genes, KLK10 and LRRC37A2, out of 24 genes. We believe that further improvement of our method with a higher number of RNA-seq gene expression data will more accurately provide insight into racial disparity in TNBC.
引用
收藏
页码:502 / 517
页数:16
相关论文
共 47 条
  • [1] Clinical significance of kallikrein-related peptidase (KLK10) mRNA expression in colorectal cancer
    Alexopoulou, Dimitra K.
    Papadopoulos, Iordanis N.
    Scorilas, Andreas
    [J]. CLINICAL BIOCHEMISTRY, 2013, 46 (15) : 1453 - 1461
  • [2] Kallikrein-Related Peptidase 10 (KLK10) Expression and Single Nucleotide Polymorphisms in Ovarian Cancer Survival
    Batra, Jyotsna
    Tan, Olivia L.
    O'Mara, Tracy
    Zammit, Rebecca
    Nagle, Christina M.
    Clements, Judith A.
    Kedda, Mary-Anne
    Spurdle, Amanda B.
    [J]. INTERNATIONAL JOURNAL OF GYNECOLOGICAL CANCER, 2010, 20 (04) : 529 - 536
  • [3] The emerging roles of human tissue kallikreins in cancer
    Borgoño, CA
    Diamandis, EP
    [J]. NATURE REVIEWS CANCER, 2004, 4 (11) : 876 - 890
  • [4] Knowledge-based analysis of microarray gene expression data by using support vector machines
    Brown, MPS
    Grundy, WN
    Lin, D
    Cristianini, N
    Sugnet, CW
    Furey, TS
    Ares, M
    Haussler, D
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (01) : 262 - 267
  • [5] Racial Disparities in Breast Cancer Diagnosis and Treatment by Hormone Receptor and HER2 Status
    Chen, Lu
    Li, Christopher I.
    [J]. CANCER EPIDEMIOLOGY BIOMARKERS & PREVENTION, 2015, 24 (11) : 1666 - 1672
  • [6] Evaluation of Racial/Ethnic Differences in Treatment and Mortality Among Women With Triple-Negative Breast Cancer
    Cho, Beomyoung
    Han, Yunan
    Lian, Min
    Colditz, Graham A.
    Weber, Jason D.
    Ma, Cynthia
    Liu, Ying
    [J]. JAMA ONCOLOGY, 2021, 7 (07) : 1016 - 1023
  • [7] sigFeature: Novel Significant Feature Selection Method for Classification of Gene Expression Data Using Support Vector Machine and t Statistic
    Das, Pijush
    Roychowdhury, Anirban
    Das, Subhadeep
    Roychoudhury, Susanta
    Tripathy, Sucheta
    [J]. FRONTIERS IN GENETICS, 2020, 11
  • [8] Triple-negative breast cancer in African-American women: disparities versus biology
    Dietze, Eric C.
    Sistrunk, Christopher
    Miranda-Carboni, Gustavo
    O'Regan, Ruth
    Seewaldt, Victoria L.
    [J]. NATURE REVIEWS CANCER, 2015, 15 (04) : 248 - 254
  • [9] Metastasis of ovarian cancer is mediated by kallikrein related peptidases
    Dong, Ying
    Loessner, Daniela
    Irving-Rodgers, Helen
    Obermair, Andreas
    Nicklin, James L.
    Clements, Judith A.
    [J]. CLINICAL & EXPERIMENTAL METASTASIS, 2014, 31 (01) : 135 - 147
  • [10] KLK10 exon 3 unmethylated PCR product concentration: a new potential early diagnostic marker in ovarian cancer? - A pilot study
    El Sherbini, Mustafa A.
    Mansour, Amal A.
    Sallam, Maha M.
    Shaban, Emtiaz A.
    ElDin, Zeinab A. Shehab
    El-Shalakany, Amr H.
    [J]. JOURNAL OF OVARIAN RESEARCH, 2018, 11