Stacking density estimation and its oversampling method for continuously imbalanced data in chemometrics

被引:0
|
作者
Zhao, Xin-Ru [1 ]
Yi, Lun-Zhao [2 ]
Fu, Guang-Hui [1 ]
机构
[1] Kunming Univ Sci & Technol, Sch Sci, Kunming 650500, Peoples R China
[2] Kunming Univ Sci & Technol, Fac Food Sci & Engn, Kunming 650500, Peoples R China
基金
中国国家自然科学基金;
关键词
Imbalanced regression; Density estimation; Stacking; Oversampling; Rare value prediction; CLASSIFICATION; REGRESSION;
D O I
10.1016/j.chemolab.2025.105366
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Continuously imbalanced data means that the target variable is continuous and its distribution This kind of data is widespread in many practical application areas. However, methods to handle continuously imbalanced data have been relatively scarce, and there is an urgent need corresponding imbalance regression methods to enhance the capability of handling continuously data. Firstly, we propose a Stacking-based density estimation (SDE) method to solve the density problem of continuously imbalanced target variables. SDE links density estimation with the Ensemble algorithm called Stacking, and its core concept is the "fusion of multiple perspectives for accurate Performing SDE enhances the model's understanding of complex data structures and makes it more and accurate in identifying rare values. Subsequently, we investigate an SDE-based oversampling (SDE-OS). SDE-OS uses SDE to synthesize new rare instances in the rare-value region, achieving customization of rare-value additions. In a series of numerical experiments, SDE has been estimated accurately than the kernel density estimation method on ANLL. SDE-OS outperforms conventional methods such as SMOGN and SMOTER in various metrics. Therefore, the proposed SDE and SDE-OS competitive and effective tools for addressing the imbalanced regression problem.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Adaptive Oversampling via Density Estimation for Online Imbalanced Classification
    Lee, Daeun
    Kim, Hyunjoong
    INFORMATION, 2025, 16 (01)
  • [2] Oversampling Method for Imbalanced Data Using Credible Counterfactual
    Gao, Feng
    Song, Mei
    Zhu, Yi
    Computer Engineering and Applications, 2024, 60 (05) : 165 - 171
  • [3] LD-SMOTE: A Novel Local Density Estimation-Based Oversampling Method for Imbalanced Datasets
    Lyu, Jiacheng
    Yang, Jie
    Su, Zhixun
    Zhu, Zilu
    SYMMETRY-BASEL, 2025, 17 (02):
  • [4] A Novel Oversampling Method for Imbalanced Datasets Based on Density Peaks Clustering
    Cao, Jie
    Shi, Yong
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2021, 28 (06): : 1813 - 1819
  • [5] A three-way decision ensemble method for imbalanced data oversampling
    Yan, Yuan Ting
    Wu, Zeng Bao
    Du, Xiu Quan
    Chen, Jie
    Zhao, Shu
    Zhang, Yan Ping
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2019, 107 (1-16) : 1 - 16
  • [6] A new boundary-degree-based oversampling method for imbalanced data
    Yueqi Chen
    Witold Pedrycz
    Jie Yang
    Applied Intelligence, 2023, 53 : 26518 - 26541
  • [7] A new boundary-degree-based oversampling method for imbalanced data
    Chen, Yueqi
    Pedrycz, Witold
    Yang, Jie
    APPLIED INTELLIGENCE, 2023, 53 (22) : 26518 - 26541
  • [8] Generative Oversampling Method for Imbalanced Data on Bearing Fault Detection and Diagnosis
    Suh, Sungho
    Lee, Haebom
    Jo, Jun
    Lukowicz, Paul
    Lee, Yong Oh
    APPLIED SCIENCES-BASEL, 2019, 9 (04):
  • [9] Generative Oversampling Method (GenOMe) for Imbalanced Data on Apnea Detection using ECG Data
    Sanabila, H. R.
    Kusuma, Ilham
    Jatmiko, Wisnu
    2016 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS), 2016, : 572 - 577
  • [10] An oversampling method for multi-class imbalanced data based on composite weights
    Deng, Mingyang
    Guo, Yingshi
    Wang, Chang
    Wu, Fuwei
    PLOS ONE, 2021, 16 (11):