GAN-based one dimensional medical data augmentation

被引:23
作者
Zhang, Ye [1 ]
Wang, Zhixiang [2 ]
Zhang, Zhen [2 ]
Liu, Junzhuo [1 ]
Feng, Ying [3 ]
Wee, Leonard [2 ]
Dekker, Andre [2 ]
Chen, Qiaosong [1 ]
Traverso, Alberto [2 ]
机构
[1] Chongqing Univ Posts & Telecommun, Key Lab Data Engn & Visual Comp, Chongqing 400065, Peoples R China
[2] Maastricht Univ, GROW Sch Oncol, Dept Radiat Oncol Maastro, Med Ctr, Maastricht, Netherlands
[3] Capital Med Univ, Beijing Friendship Hosp, Dept Ultrasound, Beijing, Peoples R China
关键词
Generative adversarial networks; SMOTE; Medical data augmentation; Deep learning; Artificial intelligence;
D O I
10.1007/s00500-023-08345-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the continuous development of human life and society, the medical field is constantly improving. However, modern medicine still faces many limitations, including challenging and previously unsolvable problems. In these cases, artificial intelligence (AI) can provide solutions. The research and application of generative adversarial networks (GAN) are a clear example. While most researchers focus on image augmentation, there are few one-dimensional data augmentation examples. The radiomics feature extracted from RT and CT images is one-dimensional data. As far as we know, we are the first to apply the WGAN-GP algorithm to generate radiomics data in the medical field. In this paper, we input a portion of the original real data samples into the model. The model learns the distribution of the input data samples and generates synthetic data samples with similar distribution to the original real data, which can solve the problem of obtaining annotated medical data samples. We have conducted experiments on the public dataset Heart Disease Cleveland and the private dataset. Compared with the traditional method of Synthetic Minority Oversampling Technique (SMOTE) and common GAN for data augmentation, our method has significantly improved the AUC and SEN values under different data proportions. At the same time, our method has also shown varying levels of improvement in ACC and SPE values. This demonstrates that our method is effective and feasible.
引用
收藏
页码:10481 / 10491
页数:11
相关论文
共 36 条
[1]  
Arjovsky M., 2017, Towards principled methods for training generative adversarial networks, DOI DOI 10.48550/ARXIV.1701.04862
[2]  
Arjovsky M, 2017, PR MACH LEARN RES, V70
[3]   MWMOTE-Majority Weighted Minority Oversampling Technique for Imbalanced Data Set Learning [J].
Barua, Sukarna ;
Islam, Md. Monirul ;
Yao, Xin ;
Murase, Kazuyuki .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (02) :405-425
[4]  
Bhagat V, 2019, 2019 FIFTH INTERNATIONAL CONFERENCE ON IMAGE INFORMATION PROCESSING (ICIIP 2019), P574, DOI 10.1109/ICIIP47207.2019.8985892
[5]   CT-based radiomics for differentiating renal tumours: a systematic review [J].
Bhandari, Abhishta ;
Ibrahim, Muhammad ;
Sharma, Chinmay ;
Liong, Rebecca ;
Gustafson, Sonja ;
Prior, Marita .
ABDOMINAL RADIOLOGY, 2021, 46 (05) :2052-2063
[6]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[7]   Cross-efficiency aggregation method based on prospect consensus process [J].
Chen, Lei ;
Wang, Ying-Ming ;
Huang, Yan .
ANNALS OF OPERATIONS RESEARCH, 2020, 288 (01) :115-135
[8]  
Cui S, 2017, 2017 2 IEEE INT C CO
[9]   RACOG and wRACOG: Two Probabilistic Oversampling Techniques [J].
Das, Barnan ;
Krishnan, Narayanan C. ;
Cook, Diane J. .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (01) :222-234
[10]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848