SAFE: Unsupervised image feature extraction using self-attention based feature extraction network

被引：0

作者：

Choi, Yeoung Je ^{[1
]}

Lee, Gyeong Taek ^{[2
]}

Kim, Chang Ouk ^{[1
,3
]}

机构：

[1] Yonsei Univ, Dept Ind Engn, Seoul, South Korea

[2] Gachon Univ, Dept Mech Smart & Ind Engn, Seongnam, South Korea

[3] Yonsei Univ, Dept Ind Engn, Seoul 03722, South Korea

来源：

EXPERT SYSTEMS | 2024年 / 41卷 / 08期

基金：

新加坡国家研究基金会;

关键词：

autoencoder; deep learning; feature representation; image processing; self-attention mechanism;

D O I：

10.1111/exsy.13583

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The ability to extract high-quality features from data is critical for machine learning applications. With the development of deep learning, various methods have been developed for image feature extraction, and unsupervised techniques have gained popularity due to their ability to operate without response variables. Autoencoders with encoder-decoder architectures are a common example of such techniques, but they are limited by a lack of proportional relationship between model reconstruction and encoder feature extraction performance. If the decoder is composed of multiple layers and mapping to a higher dimension is easier, the feature extraction performance of the encoder is likely to decrease. However, previous research has not adequately addressed this limitation. This study identifies the limitations of conventional unsupervised feature extraction techniques that utilize the encoder-decoder architecture, and proposes a novel feature extraction technique called SAFE, which utilizes a self-attention mechanism to eliminate decoder effects and improve the performance of encoder. To validate the effectiveness of the proposed model, we conducted experiments using diverse datasets (MNIST, Fashion MNIST, SVHN, and WM811K). The results of the experiments demonstrated that our proposed method exhibited, on average, 2%-10% higher performance in terms of accuracy and F-measure compared to the existing feature extraction techniques in the classification problem. While our research has limitations, specifically in its applicability only to the selection of image features, future studies should be undertaken to explore its potential application in various fields.

引用

页数：16

共 29 条

[1] Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
[2] Baldi P, 2012, P ICML WORKSH UNS TR, P37, DOI DOI 10.1561/2200000006
[3] Random forests
Breiman, L
[J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
[4] XGBoost: A Scalable Tree Boosting System
Chen, Tianqi
Guestrin, Carlos
[J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 785 - 794
[5] Chen Ting, 2020, INT C MACH LEARN, P1597
[6] Masked Autoencoders Are Scalable Vision Learners
He, Kaiming
Chen, Xinlei
Xie, Saining
Li, Yanghao
Dollar, Piotr
Girshick, Ross
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 15979 - 15988
[7] Support vector machines
Hearst, MA
[J]. IEEE INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1998, 13 (04): : 18 - 21
[8] Self-supervised learning for medical image classification: a systematic review and implementation guidelines
Huang, Shih-Cheng
Pareek, Anuj
Jensen, Malte
Lungren, Matthew P.
Yeung, Serena
Chaudhari, Akshay S.
[J]. NPJ DIGITAL MEDICINE, 2023, 6 (01)
[9] Independent component analysis:: algorithms and applications
Hyvärinen, A
Oja, E
[J]. NEURAL NETWORKS, 2000, 13 (4-5) : 411 - 430
[10] An enhanced deep learning approach for brain cancer MRI images classification using residual networks
Ismael, Sarah Ali Abdelaziz
Mohammed, Ammar
Hefny, Hesham
[J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2020, 102

← 1 2 3 →