Mixture of Experts with Entropic Regularization for Data Classification

被引:5
|
作者
Peralta, Billy [1 ]
Saavedra, Ariel [2 ]
Caro, Luis [2 ]
Soto, Alvaro [3 ]
机构
[1] Andres Bello Univ, Dept Engn Sci, Santiago 7500971, Chile
[2] Catholic Univ Temuco, Dept Informat Engn, Temuco 4781312, Chile
[3] Pontificia Univ Catolica Chile, Dept Comp Sci, Santiago 7820436, Chile
关键词
mixture-of-experts; regularization; entropy; classification;
D O I
10.3390/e21020190
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Today, there is growing interest in the automatic classification of a variety of tasks, such as weather forecasting, product recommendations, intrusion detection, and people recognition. Mixture-of-experts is a well-known classification technique; it is a probabilistic model consisting of local expert classifiers weighted by a gate network that is typically based on softmax functions, combined with learnable complex patterns in data. In this scheme, one data point is influenced by only one expert; as a result, the training process can be misguided in real datasets for which complex data need to be explained by multiple experts. In this work, we propose a variant of the regular mixture-of-experts model. In the proposed model, the cost classification is penalized by the Shannon entropy of the gating network in order to avoid a winner-takes-all output for the gating network. Experiments show the advantage of our approach using several real datasets, with improvements in mean accuracy of 3-6% in some datasets. In future work, we plan to embed feature selection into this model.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Understanding Entropic Regularization in GANs
    Reshetova, Daria
    Bai, Yikun
    Wu, Xiugang
    Ozgur, Ayfer
    2021 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2021, : 825 - 830
  • [22] Mixture of Experts for EEG-Based Seizure Subtype Classification
    Du, Zhenbang
    Peng, Ruimin
    Liu, Wenzhong
    Li, Wei
    Wu, Dongrui
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2023, 31 : 4781 - 4789
  • [23] A MIXTURE OF EXPERTS APPROACH TOWARDS INTELLIGIBILITY CLASSIFICATION OF PATHOLOGICAL SPEECH
    Gupta, Rahul
    Audhkhasi, Kartik
    Narayanan, Shrikanth
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 1986 - 1990
  • [24] Wavelet/mixture of experts network structure for EEG signals classification
    Uebeyli, Elif Derya
    EXPERT SYSTEMS WITH APPLICATIONS, 2008, 34 (03) : 1954 - 1962
  • [25] Root-quatric mixture of experts for complex classification problems
    Abbasi, Elham
    Shiri, Mohammad Ebrahim
    Ghatee, Mehdi
    EXPERT SYSTEMS WITH APPLICATIONS, 2016, 53 : 192 - 203
  • [26] ENTROPIC VOLTERRA CLASSIFIER (EVC) FOR USE IN DATA CLASSIFICATION
    LASENBY, J
    FITZGERALD, WJ
    ELECTRONICS LETTERS, 1994, 30 (01) : 53 - 54
  • [27] Entropic Regularization of Markov Decision Processes
    Belousov, Boris
    Peters, Jan
    ENTROPY, 2019, 21 (07)
  • [28] Soft Quantization Using Entropic Regularization
    Lakshmanan, Rajmadan
    Pichler, Alois
    ENTROPY, 2023, 25 (10)
  • [29] Mixture of experts for audio classification: an application to male female classification and musical genre recognition
    Harb, H
    Chen, LM
    Auloge, JY
    2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 1351 - 1354
  • [30] Joint Classification of Hyperspectral Images and LiDAR Data Based on Candidate Pseudo Labels Pruning and Dual Mixture of Experts
    Kong, Yi
    Yu, Shaocai
    Cheng, Yuhu
    Chen, C. L. Philip
    Wang, Xuesong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63