Dimensionality Reduction for Identification of Hepatic Tumor Samples Based on Terahertz Time-Domain Spectroscopy

被引:41
作者
Liu, Haishun [1 ,2 ]
Zhang, Zhenwei [1 ,2 ]
Zhang, Xin [3 ]
Yang, Yuping [4 ]
Zhang, Zhuoyong [3 ]
Liu, Xiangyi [5 ]
Wang, Fan [5 ]
Han, Yiding [6 ]
Zhang, Cunlin [1 ,2 ]
机构
[1] Capital Normal Univ, Minist Educ, Beijing Key Lab Terahertz Spect & Imaging, Key Lab Terahertz Optoelect, Beijing 100048, Peoples R China
[2] Capital Normal Univ, Dept Phys, Beijing Adv Innovat Ctr Imaging Technol, Beijing 100048, Peoples R China
[3] Capital Normal Univ, Dept Chem, Beijing 100048, Peoples R China
[4] Minzu Univ China, Sch Sci, Beijing 100081, Peoples R China
[5] Capital Med Univ, Beijing Tongren Hosp, Dept Lab Med, Beijing 100730, Peoples R China
[6] Capital Med Univ, Beijing Tongren Hosp, Dept Pathol, Beijing 100730, Peoples R China
基金
中国国家自然科学基金;
关键词
Dimensionality reduction; Isomap; locality preserving projections (LPPs); principle component analysis (PCA); terahertz (THz); PULSED SPECTROSCOPY; CLASSIFICATION; CANCER; DISCRIMINATION; ISOMAP;
D O I
10.1109/TTHZ.2018.2813085
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Terahertz time-domain spectroscopy (THz-TDS) combining with chemometrics methods was proposed for the identification of hepatic tumors. Two linear compression methods, principle component analysis and locality preserving projections (LPPs), and a nonlinear method, Isomap, were used to reduce the dimensionality of the measured dataset. Comparing two-dimensional (2-D) data reduced by these three dimensionality reduction techniques, only 2-D Isomap plot could separate the distances between two classes for the THz time-domain data and LPP had capacity of distinguishing two types of samples building on frequency-domain data. The best classification accuracies from 2-D time-domain data were 99.81 +/- 0.30% and 99.69 +/- 0.61% given by Isomap probabilistic neural network (PNN) and Isomap support vector machine (SVM), respectively, while the best classification results of 2-D frequency-domain data were 100.00 +/- 0.00%, 99.75 +/- 0.32% provided by LPP-PNN, LPP-SVM. The results showed that Isomap and LPP are appropriate techniques to reflect the nonlinear manifold of the THz data. The THz technology either in time-domain or frequency-domain coupled with Isomap-PNN or LPP-PNN could offer a potential procedure to identify hepatic tumors.
引用
收藏
页码:271 / 277
页数:7
相关论文
共 38 条
[31]   Support vector machine active learning with applications to text classification [J].
Tong, S ;
Koller, D .
JOURNAL OF MACHINE LEARNING RESEARCH, 2002, 2 (01) :45-66
[32]  
Van Der Maaten L., 2009, Journal of Machine Learning Research, V10, P66
[33]  
Vapnik V.N., 2000, NATURE STAT LEARNING, P988, DOI DOI 10.1007/978-1-4757-2440-0
[34]   Terahertz pulsed spectroscopy of human basal cell carcinoma [J].
Wallace, Vincent P. ;
Fitzgerald, Anthony J. ;
Pickwell, Emma ;
Pye, Richard J. ;
Taday, Philip F. ;
Flanagan, Niamh ;
Ha, Thomas .
APPLIED SPECTROSCOPY, 2006, 60 (10) :1127-1133
[35]   Terahertz pulse imaging in reflection geometry of human skin cancer and skin tissue [J].
Woodward, RM ;
Cole, BE ;
Wallace, VP ;
Pye, RJ ;
Arnone, DD ;
Linfield, EH ;
Pepper, M .
PHYSICS IN MEDICINE AND BIOLOGY, 2002, 47 (21) :3853-3863
[36]  
Xiaofei He, 2004, Proceedings of Sheffield SIGIR 2004. The Twenty-Seventh Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P96
[37]   Discrimination of Transgenic Rice containing the Cry1Ab Protein using Terahertz Spectroscopy and Chemometrics [J].
Xu, Wendao ;
Xie, Lijuan ;
Ye, Zunzhong ;
Gao, Weilu ;
Yao, Yang ;
Chen, Min ;
Qin, Jianyuan ;
Ying, Yibin .
SCIENTIFIC REPORTS, 2015, 5
[38]   Brain tumor imaging of rat fresh tissue using terahertz spectroscopy [J].
Yamaguchi, Sayuri ;
Fukushi, Yasuko ;
Kubota, Oichi ;
Itsuji, Takeaki ;
Ouchi, Toshihiko ;
Yamamoto, Seiji .
SCIENTIFIC REPORTS, 2016, 6