Raman spectroscopy and topological machine learning for cancer grading

被引:13
作者
Conti, Francesco [1 ,2 ]
D'Acunto, Mario [3 ]
Caudai, Claudia [1 ]
Colantonio, Sara [1 ]
Gaeta, Raffaele [4 ]
Moroni, Davide [1 ]
Pascali, Maria Antonietta [1 ]
机构
[1] Natl Res Council Italy, Inst Informat Sci & Technol, Via G Moruzzi 1, I-56124 Pisa, Italy
[2] Univ Pisa, Dept Math, Largo B Pontecorvo, I-56126 Pisa, Italy
[3] Natl Res Council Italy, Inst Biophys, Via G Moruzzi 1, I-56124 Pisa, Italy
[4] Univ Pisa, Dept Surg Med Mol Pathol & Crit Area, Div Surg Pathol, Via Paradisa 2, I-56124 Pisa, Italy
关键词
DIAGNOSIS;
D O I
10.1038/s41598-023-34457-5
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
In the last decade, Raman Spectroscopy is establishing itself as a highly promising technique for the classification of tumour tissues as it allows to obtain the biochemical maps of the tissues under investigation, making it possible to observe changes among different tissues in terms of biochemical constituents (proteins, lipid structures, DNA, vitamins, and so on). In this paper, we aim to show that techniques emerging from the cross-fertilization of persistent homology and machine learning can support the classification of Raman spectra extracted from cancerous tissues for tumour grading. In more detail, topological features of Raman spectra and machine learning classifiers are trained in combination as an automatic classification pipeline in order to select the best-performing pair. The case study is the grading of chondrosarcoma in four classes: cross and leave-one-patient-out validations have been used to assess the classification accuracy of the method. The binary classification achieves a validation accuracy of 81% and a test accuracy of 90%. Moreover, the test dataset has been collected at a different time and with different equipment. Such results are achieved by a support vector classifier trained with the Betti Curve representation of the topological features extracted from the Raman spectra, and are excellent compared with the existing literature. The added value of such results is that the model for the prediction of the chondrosarcoma grading could easily be implemented in clinical practice, possibly integrated into the acquisition system.
引用
收藏
页数:16
相关论文
共 44 条
[1]  
Adams H, 2017, J MACH LEARN RES, V18
[2]   RELATIONSHIP BETWEEN VARIABLE SELECTION AND DATA AUGMENTATION AND A METHOD FOR PREDICTION [J].
ALLEN, DM .
TECHNOMETRICS, 1974, 16 (01) :125-127
[3]  
Ankerst M, 1999, SIGMOD RECORD, VOL 28, NO 2 - JUNE 1999, P49
[4]  
Bergholt M.S., 2013, J Gastroint. Dig. Syst. S, V1, P008
[5]   Raman endoscopy for in vivo differentiation between benign and malignant ulcers in the stomach [J].
Bergholt, Mads Sylvest ;
Zheng, Wei ;
Lin, Kan ;
Ho, Khek Yu ;
Teh, Ming ;
Yeoh, Khay Guan ;
So, Jimmy Bok Yan ;
Huang, Zhiwei .
ANALYST, 2010, 135 (12) :3162-3168
[6]   The Accumulated Persistence Function, a New Useful Functional Summary Statistic for Topological Data Analysis, With a View to Brain Artery Trees and Spatial Point Process Applications [J].
Biscio, Christophe A. N. ;
Moller, Jesper .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2019, 28 (03) :671-681
[7]  
Bubenik P, 2015, J MACH LEARN RES, V16, P77
[8]   An algebraic topological method for feature identification [J].
Carlsson, Erik ;
Carlsson, Gunnar ;
De Silva, Vin .
INTERNATIONAL JOURNAL OF COMPUTATIONAL GEOMETRY & APPLICATIONS, 2006, 16 (04) :291-314
[9]   TOPOLOGY AND DATA [J].
Carlsson, Gunnar .
BULLETIN OF THE AMERICAN MATHEMATICAL SOCIETY, 2009, 46 (02) :255-308
[10]  
Chazal F., 2014, P 30 ANN S COMPUTATI, P474, DOI [10.1145/2582112.2582128, DOI 10.1145/2582112.2582128]