An oversampling method for multi-class imbalanced data based on composite weights
被引:9
|
作者:
Deng, Mingyang
论文数: 0引用数: 0
h-index: 0
机构:
Changan Univ, Sch Automobile, Xian, Peoples R China
Changchun Univ Technol, Coll Automobile Engn, Coll Humanities & Informat, Changchun, Peoples R ChinaChangan Univ, Sch Automobile, Xian, Peoples R China
Deng, Mingyang
[1
,2
]
Guo, Yingshi
论文数: 0引用数: 0
h-index: 0
机构:
Changan Univ, Sch Automobile, Xian, Peoples R ChinaChangan Univ, Sch Automobile, Xian, Peoples R China
Guo, Yingshi
[1
]
Wang, Chang
论文数: 0引用数: 0
h-index: 0
机构:
Changan Univ, Sch Automobile, Xian, Peoples R ChinaChangan Univ, Sch Automobile, Xian, Peoples R China
Wang, Chang
[1
]
Wu, Fuwei
论文数: 0引用数: 0
h-index: 0
机构:
Changan Univ, Sch Automobile, Xian, Peoples R ChinaChangan Univ, Sch Automobile, Xian, Peoples R China
Wu, Fuwei
[1
]
机构:
[1] Changan Univ, Sch Automobile, Xian, Peoples R China
[2] Changchun Univ Technol, Coll Automobile Engn, Coll Humanities & Informat, Changchun, Peoples R China
来源:
PLOS ONE
|
2021年
/
16卷
/
11期
基金:
国家重点研发计划;
中国国家自然科学基金;
关键词:
ALGORITHM;
CLASSIFICATION;
SMOTE;
D O I:
10.1371/journal.pone.0259227
中图分类号:
O [数理科学和化学];
P [天文学、地球科学];
Q [生物科学];
N [自然科学总论];
学科分类号:
07 ;
0710 ;
09 ;
摘要:
To solve the oversampling problem of multi-class small samples and to improve their classification accuracy, we develop an oversampling method based on classification ranking and weight setting. The designed oversampling algorithm sorts the data within each class of dataset according to the distance from original data to the hyperplane. Furthermore, iterative sampling is performed within the class and inter-class sampling is adopted at the boundaries of adjacent classes according to the sampling weight composed of data density and data sorting. Finally, information assignment is performed on all newly generated sampling data. The training and testing experiments of the algorithm are conducted by using the UCI imbalanced datasets, and the established composite metrics are used to evaluate the performance of the proposed algorithm and other algorithms in comprehensive evaluation method. The results show that the proposed algorithm makes the multi-class imbalanced data balanced in terms of quantity, and the newly generated data maintain the distribution characteristics and information properties of the original samples. Moreover, compared with other algorithms such as SMOTE and SVMOM, the proposed algorithm has reached a higher classification accuracy of about 90%. It is concluded that this algorithm has high practicability and general characteristics for imbalanced multi-class samples.
机构:
Univ Southern Mississippi, Sch Comp Sci & Comp Engn, Hattiesburg, MS 39406 USAZhengzhou Univ, Natl Supercomp Ctr Zhengzhou, Sch Comp & Artificial Intelligence, Zhengzhou 450001, Peoples R China
Luttrell, Joseph
Gong, Ping
论文数: 0引用数: 0
h-index: 0
机构:
US Army Engineer Res & Dev Ctr, Environm Lab, Vicksburg, MS 39180 USAZhengzhou Univ, Natl Supercomp Ctr Zhengzhou, Sch Comp & Artificial Intelligence, Zhengzhou 450001, Peoples R China
Gong, Ping
Zhang, Chaoyang
论文数: 0引用数: 0
h-index: 0
机构:
Univ Southern Mississippi, Sch Comp Sci & Comp Engn, Hattiesburg, MS 39406 USAZhengzhou Univ, Natl Supercomp Ctr Zhengzhou, Sch Comp & Artificial Intelligence, Zhengzhou 450001, Peoples R China
机构:
Univ Teknol PETRONAS, Dept Comp & Informat Sci, Seri Iskandar 32610, Perak, Malaysia
Minist Narcot Control, Antinarcot Force, Islamabad 46000, PakistanUniv Teknol PETRONAS, Dept Comp & Informat Sci, Seri Iskandar 32610, Perak, Malaysia
Palli, Abdul Sattar
Jaafar, Jafreezal
论文数: 0引用数: 0
h-index: 0
机构:
Univ Teknol PETRONAS, Dept Comp & Informat Sci, Seri Iskandar 32610, Perak, Malaysia
UTP, Ctr Res Data Sci, Seri Iskandar 32610, Perak, MalaysiaUniv Teknol PETRONAS, Dept Comp & Informat Sci, Seri Iskandar 32610, Perak, Malaysia
Jaafar, Jafreezal
Hashmani, Manzoor Ahmed
论文数: 0引用数: 0
h-index: 0
机构:
Univ Teknol PETRONAS, Dept Comp & Informat Sci, Seri Iskandar 32610, Perak, Malaysia
UTP, High Performance Cloud Comp Ctr HPC3, Seri Iskandar 32610, Perak, MalaysiaUniv Teknol PETRONAS, Dept Comp & Informat Sci, Seri Iskandar 32610, Perak, Malaysia
Hashmani, Manzoor Ahmed
Gomes, Heitor Murilo
论文数: 0引用数: 0
h-index: 0
机构:
Univ Waikato Wellington, AI Inst, Hamilton 3240, New Zealand
Victoria Univ Wellington, Sch Engn & Comp Sci, Wellington 6012, New ZealandUniv Teknol PETRONAS, Dept Comp & Informat Sci, Seri Iskandar 32610, Perak, Malaysia
Gomes, Heitor Murilo
Gilal, Abdul Rehman
论文数: 0引用数: 0
h-index: 0
机构:
Univ Teknol PETRONAS, Dept Comp & Informat Sci, Seri Iskandar 32610, Perak, Malaysia
Sukkur IBA Univ, Dept Comp Sci, Sukkur 65200, Sindh, PakistanUniv Teknol PETRONAS, Dept Comp & Informat Sci, Seri Iskandar 32610, Perak, Malaysia