共 50 条
AWGAN: An adaptive weighting GAN approach for oversampling imbalanced datasets
被引:21
作者:

Guan, Shaopeng
论文数: 0 引用数: 0
h-index: 0
机构:
Shandong Technol & Business Univ, Sch Informat & Elect Engn, Yantai 264005, Peoples R China Shandong Technol & Business Univ, Sch Informat & Elect Engn, Yantai 264005, Peoples R China

Zhao, Xiaoyan
论文数: 0 引用数: 0
h-index: 0
机构:
Shandong Technol & Business Univ, Sch Informat & Elect Engn, Yantai 264005, Peoples R China Shandong Technol & Business Univ, Sch Informat & Elect Engn, Yantai 264005, Peoples R China

Xue, Yuewei
论文数: 0 引用数: 0
h-index: 0
机构:
Shandong Technol & Business Univ, Sch Informat & Elect Engn, Yantai 264005, Peoples R China Shandong Technol & Business Univ, Sch Informat & Elect Engn, Yantai 264005, Peoples R China

Pan, Hao
论文数: 0 引用数: 0
h-index: 0
机构:
Shandong Technol & Business Univ, Sch Informat & Elect Engn, Yantai 264005, Peoples R China Shandong Technol & Business Univ, Sch Informat & Elect Engn, Yantai 264005, Peoples R China
机构:
[1] Shandong Technol & Business Univ, Sch Informat & Elect Engn, Yantai 264005, Peoples R China
关键词:
Imbalanced dataset;
Oversampling technique;
Generative adversarial networks;
Overlapping;
Intra-class imbalance;
SMOTE;
D O I:
10.1016/j.ins.2024.120311
中图分类号:
TP [自动化技术、计算机技术];
学科分类号:
0812 ;
摘要:
Oversampling is a widely employed technique for addressing imbalanced datasets, facing challenges like class overlaps, intra-class imbalance, and noise. In this paper, we introduce an adaptive weighted oversampling algorithm grounded in generative adversarial networks, which we term AWGAN. To begin, our method computes the local and global densities for each instance, confirming its distribution within its local neighborhood, thereby enabling accurate identification and elimination of noisy instances. Subsequently, we devise a weight calculation strategy based on boundary division. Minority class instances are classified into safe and boundary instances, and weights are calculated based on the density of each instance and its distance from the surrounding instances, assigning different weights to overlapping and non -overlapping regions, and sparse and dense region instances, in order to solve the problems of class overlap and intraclass imbalance. Finally, GAN is used to construct a balanced dataset by adaptively generating minority class instances that match the real data distribution based on the weights. We evaluate AWGAN against six traditional oversampling methods and five GAN-based oversampling methods. The experimental results demonstrate that AWGAN significantly enhances classifier performance, as evident in its F1 -Score, AUC, G -mean, and MCC on 21 diverse datasets.
引用
收藏
页数:24
相关论文
共 50 条
[1]
RN-SMOTE: Reduced Noise SMOTE based on DBSCAN for enhancing imbalanced data classification
[J].
Arafa, Ahmed
;
El-Fishawy, Nawal
;
Badawy, Mohammed
;
Radad, Marwa
.
JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES,
2022, 34 (08)
:5059-5074

Arafa, Ahmed
论文数: 0 引用数: 0
h-index: 0
机构:
Menoufia Univ, Fac Elect Engn, El Gish St,Box 32951, Menoufia 32951, Egypt Menoufia Univ, Fac Elect Engn, El Gish St,Box 32951, Menoufia 32951, Egypt

El-Fishawy, Nawal
论文数: 0 引用数: 0
h-index: 0
机构:
Menoufia Univ, Fac Elect Engn, El Gish St,Box 32951, Menoufia 32951, Egypt Menoufia Univ, Fac Elect Engn, El Gish St,Box 32951, Menoufia 32951, Egypt

论文数: 引用数:
h-index:
机构:

Radad, Marwa
论文数: 0 引用数: 0
h-index: 0
机构:
Menoufia Univ, Fac Elect Engn, El Gish St,Box 32951, Menoufia 32951, Egypt Menoufia Univ, Fac Elect Engn, El Gish St,Box 32951, Menoufia 32951, Egypt
[2]
SMOTE-LOF for noise identification in imbalanced data classification
[J].
Asniar, Nur Ulfa
;
Maulidevi, Nur Ulfa
;
Surendro, Kridanto
.
JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES,
2022, 34 (06)
:3413-3423

Asniar, Nur Ulfa
论文数: 0 引用数: 0
h-index: 0
机构:
Inst Teknol Bandung, Sch Elect Engn & Informat, Jl Ganesha 10, Bandung, Indonesia
Telkom Univ, Sch Appl Sci, Bandung, Indonesia Inst Teknol Bandung, Sch Elect Engn & Informat, Jl Ganesha 10, Bandung, Indonesia

Maulidevi, Nur Ulfa
论文数: 0 引用数: 0
h-index: 0
机构:
Inst Teknol Bandung, Sch Elect Engn & Informat, Jl Ganesha 10, Bandung, Indonesia
PUI PT AI VLB Artificial Intelligence Vis, Nat Language Proc & Big Data Analyt, Malang, Indonesia Inst Teknol Bandung, Sch Elect Engn & Informat, Jl Ganesha 10, Bandung, Indonesia

Surendro, Kridanto
论文数: 0 引用数: 0
h-index: 0
机构:
Inst Teknol Bandung, Sch Elect Engn & Informat, Jl Ganesha 10, Bandung, Indonesia Inst Teknol Bandung, Sch Elect Engn & Informat, Jl Ganesha 10, Bandung, Indonesia
[3]
An Investigation of SMOTE Based Methods for Imbalanced Datasets With Data Complexity Analysis
[J].
Azhar, Nur Athirah
;
Pozi, Muhammad Syafiq Mohd
;
Din, Aniza Mohamed
;
Jatowt, Adam
.
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING,
2023, 35 (07)
:6651-6672

Azhar, Nur Athirah
论文数: 0 引用数: 0
h-index: 0
机构:
Malaysian Investment Dev Author, Kuala Lumpur 50470, Malaysia Malaysian Investment Dev Author, Kuala Lumpur 50470, Malaysia

Pozi, Muhammad Syafiq Mohd
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Utara Malaysia, Inst Adv & Smart Digital Opportun, Sch Comp, Bukit Kayu Hitam 06010, Kedah, Malaysia
Univ Kebangsaan Malaysia, Inst IR 4 0, Bangi 43600, Selangor, Malaysia Malaysian Investment Dev Author, Kuala Lumpur 50470, Malaysia

Din, Aniza Mohamed
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Utara Malaysia, Sch Comp, Data Sci Res Lab, Bukit Kayu Hitam 06010, Kedah, Malaysia Malaysian Investment Dev Author, Kuala Lumpur 50470, Malaysia

Jatowt, Adam
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Innsbruck, Digital Sci Ctr, Dept Comp Sci, A-6020 Innsbruck, Tirol, Austria Malaysian Investment Dev Author, Kuala Lumpur 50470, Malaysia
[4]
Rule extraction in unsupervised anomaly detection for model explainability: Application to OneClass SVM
[J].
Barbado, Alberto
;
Corcho, Oscar
;
Benjamins, Richard
.
EXPERT SYSTEMS WITH APPLICATIONS,
2022, 189

Barbado, Alberto
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Politecn Madrid, Dept Inteligencia Artificial, Madrid 28223, Spain
Telefon IoT & Big Data Tech SA, Madrid, Spain Univ Politecn Madrid, Dept Inteligencia Artificial, Madrid 28223, Spain

Corcho, Oscar
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Politecn Madrid, Dept Inteligencia Artificial, Madrid 28223, Spain Univ Politecn Madrid, Dept Inteligencia Artificial, Madrid 28223, Spain

Benjamins, Richard
论文数: 0 引用数: 0
h-index: 0
机构:
Telefon IoT & Big Data Tech SA, Madrid, Spain Univ Politecn Madrid, Dept Inteligencia Artificial, Madrid 28223, Spain
[5]
MWMOTE-Majority Weighted Minority Oversampling Technique for Imbalanced Data Set Learning
[J].
Barua, Sukarna
;
Islam, Md. Monirul
;
Yao, Xin
;
Murase, Kazuyuki
.
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING,
2014, 26 (02)
:405-425

Barua, Sukarna
论文数: 0 引用数: 0
h-index: 0
机构:
Bangladesh Univ Engn & Technol, Dept Comp Sci & Engn, Dhaka 1000, Bangladesh Bangladesh Univ Engn & Technol, Dept Comp Sci & Engn, Dhaka 1000, Bangladesh

Islam, Md. Monirul
论文数: 0 引用数: 0
h-index: 0
机构:
Bangladesh Univ Engn & Technol, Dept Comp Sci & Engn, Dhaka 1000, Bangladesh Bangladesh Univ Engn & Technol, Dept Comp Sci & Engn, Dhaka 1000, Bangladesh

Yao, Xin
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Birmingham, Nat Computat Grp, Sch Comp Sci, Birmingham B15 2TT, W Midlands, England Bangladesh Univ Engn & Technol, Dept Comp Sci & Engn, Dhaka 1000, Bangladesh

Murase, Kazuyuki
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Fukui, Dept Human & Artificial Intelligence Syst, Fukui 9108507, Japan Bangladesh Univ Engn & Technol, Dept Comp Sci & Engn, Dhaka 1000, Bangladesh
[6]
LOF: Identifying density-based local outliers
[J].
Breunig, MM
;
Kriegel, HP
;
Ng, RT
;
Sander, J
.
SIGMOD RECORD,
2000, 29 (02)
:93-104

Breunig, MM
论文数: 0 引用数: 0
h-index: 0
机构: Univ Munich, Inst Comp Sci, D-80538 Munich, Germany

Kriegel, HP
论文数: 0 引用数: 0
h-index: 0
机构: Univ Munich, Inst Comp Sci, D-80538 Munich, Germany

Ng, RT
论文数: 0 引用数: 0
h-index: 0
机构: Univ Munich, Inst Comp Sci, D-80538 Munich, Germany

Sander, J
论文数: 0 引用数: 0
h-index: 0
机构: Univ Munich, Inst Comp Sci, D-80538 Munich, Germany
[7]
SMOTE: Synthetic minority over-sampling technique
[J].
Chawla, Nitesh V.
;
Bowyer, Kevin W.
;
Hall, Lawrence O.
;
Kegelmeyer, W. Philip
. 2002, American Association for Artificial Intelligence (16)

Chawla, Nitesh V.
论文数: 0 引用数: 0
h-index: 0
机构:
Department of Computer Science and Engineering, ENB 118, University of South Florida, 4202 E. Fowler Ave, Tampa, FL 33620-5399, United States Department of Computer Science and Engineering, ENB 118, University of South Florida, 4202 E. Fowler Ave, Tampa, FL 33620-5399, United States

Bowyer, Kevin W.
论文数: 0 引用数: 0
h-index: 0
机构:
Department of Computer Science and Engineering, 384 Fitzpatrick Hall, University of Notre Dame, Notre Dame, IN 46556, United States Department of Computer Science and Engineering, ENB 118, University of South Florida, 4202 E. Fowler Ave, Tampa, FL 33620-5399, United States

Hall, Lawrence O.
论文数: 0 引用数: 0
h-index: 0
机构:
Department of Computer Science and Engineering, ENB 118, University of South Florida, 4202 E. Fowler Ave, Tampa, FL 33620-5399, United States Department of Computer Science and Engineering, ENB 118, University of South Florida, 4202 E. Fowler Ave, Tampa, FL 33620-5399, United States

Kegelmeyer, W. Philip
论文数: 0 引用数: 0
h-index: 0
机构:
Sandia National Laboratories, Biosystems Research Department, MS 9951, P.O. Box 969, Livermore, CA, United States Department of Computer Science and Engineering, ENB 118, University of South Florida, 4202 E. Fowler Ave, Tampa, FL 33620-5399, United States
[8]
Class-Imbalanced Deep Learning via a Class-Balanced Ensemble
[J].
Chen, Zhi
;
Duan, Jiang
;
Kang, Li
;
Qiu, Guoping
.
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS,
2022, 33 (10)
:5626-5640

Chen, Zhi
论文数: 0 引用数: 0
h-index: 0
机构:
Southwestern Univ Finance & Econ, Blockchain Res Ctr China, Sch Econ Informat Engn, Chengdu 611130, Peoples R China Southwestern Univ Finance & Econ, Blockchain Res Ctr China, Sch Econ Informat Engn, Chengdu 611130, Peoples R China

Duan, Jiang
论文数: 0 引用数: 0
h-index: 0
机构:
Southwestern Univ Finance & Econ, Blockchain Res Ctr China, Sch Econ Informat Engn, Chengdu 611130, Peoples R China Southwestern Univ Finance & Econ, Blockchain Res Ctr China, Sch Econ Informat Engn, Chengdu 611130, Peoples R China

Kang, Li
论文数: 0 引用数: 0
h-index: 0
机构:
Southwestern Univ Finance & Econ, Blockchain Res Ctr China, Sch Econ Informat Engn, Chengdu 611130, Peoples R China Southwestern Univ Finance & Econ, Blockchain Res Ctr China, Sch Econ Informat Engn, Chengdu 611130, Peoples R China

Qiu, Guoping
论文数: 0 引用数: 0
h-index: 0
机构:
Shenzhen Univ, Coll Elect & Informat Engn, Guangdong Key Lab Intelligent Informat Proc, Shenzhen 518060, Peoples R China
Shenzhen Univ, Shenzhen Inst Artificial Intelligence & Robot Soc, Shenzhen 518060, Peoples R China
Univ Nottingham, Sch Comp Sci, Nottingham NG8 1BB, England Southwestern Univ Finance & Econ, Blockchain Res Ctr China, Sch Econ Informat Engn, Chengdu 611130, Peoples R China
[9]
Class-overlap undersampling based on Schur decomposition for Class-imbalance problems
[J].
Dai, Qi
;
Liu, Jian-wei
;
Shi, Yong-hui
.
EXPERT SYSTEMS WITH APPLICATIONS,
2023, 221

Dai, Qi
论文数: 0 引用数: 0
h-index: 0
机构:
China Univ Petr, Coll Informat Sci & Engn, Dept Automat, Beijing, Peoples R China China Univ Petr, Coll Informat Sci & Engn, Dept Automat, Beijing, Peoples R China

Liu, Jian-wei
论文数: 0 引用数: 0
h-index: 0
机构:
China Univ Petr, Coll Informat Sci & Engn, Dept Automat, Beijing, Peoples R China
China Univ Petr, 260 mailbox, Beijing 102249, Peoples R China China Univ Petr, Coll Informat Sci & Engn, Dept Automat, Beijing, Peoples R China

Shi, Yong-hui
论文数: 0 引用数: 0
h-index: 0
机构:
North China Univ Sci & Technol, Coll Sci, Tangshan, Peoples R China China Univ Petr, Coll Informat Sci & Engn, Dept Automat, Beijing, Peoples R China
[10]
Class-imbalanced positive instances augmentation via three-line hybrid
[J].
Dai, Qi
;
Liu, Jian-wei
;
Yang, Jia-peng
.
KNOWLEDGE-BASED SYSTEMS,
2022, 257

Dai, Qi
论文数: 0 引用数: 0
h-index: 0
机构:
China Univ Petr, Coll Informat Sci & Engn, Dept Automation, Beijing, Peoples R China China Univ Petr, Coll Informat Sci & Engn, Dept Automation, Beijing, Peoples R China

Liu, Jian-wei
论文数: 0 引用数: 0
h-index: 0
机构:
China Univ Petr, Coll Informat Sci & Engn, Dept Automation, Beijing, Peoples R China
China Univ Petr, 260 mailbox, Beijing 102249, Peoples R China China Univ Petr, Coll Informat Sci & Engn, Dept Automation, Beijing, Peoples R China

Yang, Jia-peng
论文数: 0 引用数: 0
h-index: 0
机构:
North China Univ Sci & Technol NCUST, Coll Sci, Tangshan, Peoples R China China Univ Petr, Coll Informat Sci & Engn, Dept Automation, Beijing, Peoples R China