SAFE: Unsupervised image feature extraction using self-attention based feature extraction network

被引:0
作者
Choi, Yeoung Je [1 ]
Lee, Gyeong Taek [2 ]
Kim, Chang Ouk [1 ,3 ]
机构
[1] Yonsei Univ, Dept Ind Engn, Seoul, South Korea
[2] Gachon Univ, Dept Mech Smart & Ind Engn, Seongnam, South Korea
[3] Yonsei Univ, Dept Ind Engn, Seoul 03722, South Korea
基金
新加坡国家研究基金会;
关键词
autoencoder; deep learning; feature representation; image processing; self-attention mechanism;
D O I
10.1111/exsy.13583
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The ability to extract high-quality features from data is critical for machine learning applications. With the development of deep learning, various methods have been developed for image feature extraction, and unsupervised techniques have gained popularity due to their ability to operate without response variables. Autoencoders with encoder-decoder architectures are a common example of such techniques, but they are limited by a lack of proportional relationship between model reconstruction and encoder feature extraction performance. If the decoder is composed of multiple layers and mapping to a higher dimension is easier, the feature extraction performance of the encoder is likely to decrease. However, previous research has not adequately addressed this limitation. This study identifies the limitations of conventional unsupervised feature extraction techniques that utilize the encoder-decoder architecture, and proposes a novel feature extraction technique called SAFE, which utilizes a self-attention mechanism to eliminate decoder effects and improve the performance of encoder. To validate the effectiveness of the proposed model, we conducted experiments using diverse datasets (MNIST, Fashion MNIST, SVHN, and WM811K). The results of the experiments demonstrated that our proposed method exhibited, on average, 2%-10% higher performance in terms of accuracy and F-measure compared to the existing feature extraction techniques in the classification problem. While our research has limitations, specifically in its applicability only to the selection of image features, future studies should be undertaken to explore its potential application in various fields.
引用
收藏
页数:16
相关论文
共 29 条
  • [1] Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
  • [2] Baldi P, 2012, P ICML WORKSH UNS TR, P37, DOI DOI 10.1561/2200000006
  • [3] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [4] XGBoost: A Scalable Tree Boosting System
    Chen, Tianqi
    Guestrin, Carlos
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 785 - 794
  • [5] Chen Ting, 2020, INT C MACH LEARN, P1597
  • [6] Masked Autoencoders Are Scalable Vision Learners
    He, Kaiming
    Chen, Xinlei
    Xie, Saining
    Li, Yanghao
    Dollar, Piotr
    Girshick, Ross
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 15979 - 15988
  • [7] Support vector machines
    Hearst, MA
    [J]. IEEE INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1998, 13 (04): : 18 - 21
  • [8] Self-supervised learning for medical image classification: a systematic review and implementation guidelines
    Huang, Shih-Cheng
    Pareek, Anuj
    Jensen, Malte
    Lungren, Matthew P.
    Yeung, Serena
    Chaudhari, Akshay S.
    [J]. NPJ DIGITAL MEDICINE, 2023, 6 (01)
  • [9] Independent component analysis:: algorithms and applications
    Hyvärinen, A
    Oja, E
    [J]. NEURAL NETWORKS, 2000, 13 (4-5) : 411 - 430
  • [10] An enhanced deep learning approach for brain cancer MRI images classification using residual networks
    Ismael, Sarah Ali Abdelaziz
    Mohammed, Ammar
    Hefny, Hesham
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2020, 102