Classification of Adventitious Sounds Combining Cochleogram and Vision Transformers

被引:4
作者
Mang, Loredana Daria [1 ]
Martinez, Francisco David Gonzalez [1 ]
Munoz, Damian Martinez [1 ]
Galan, Sebastian Garcia [1 ]
Cortina, Raquel [2 ]
机构
[1] Univ Jaen, Dept Telecommun Engn, Linares 23700, Spain
[2] Univ Oviedo, Dept Comp Sci, Oviedo 33003, Spain
关键词
classification; adventitious sounds; cochleogram; vision transformers; deep learning; accuracy; LUNG SOUNDS; FRACTAL DIMENSION; TIME-FREQUENCY; NEURAL-NETWORK; CNN MODEL; SEPARATION; FACTORIZATION; SYSTEM;
D O I
10.3390/s24020682
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Early identification of respiratory irregularities is critical for improving lung health and reducing global mortality rates. The analysis of respiratory sounds plays a significant role in characterizing the respiratory system's condition and identifying abnormalities. The main contribution of this study is to investigate the performance when the input data, represented by cochleogram, is used to feed the Vision Transformer (ViT) architecture, since this input-classifier combination is the first time it has been applied to adventitious sound classification to our knowledge. Although ViT has shown promising results in audio classification tasks by applying self-attention to spectrogram patches, we extend this approach by applying the cochleogram, which captures specific spectro-temporal features of adventitious sounds. The proposed methodology is evaluated on the ICBHI dataset. We compare the classification performance of ViT with other state-of-the-art CNN approaches using spectrogram, Mel frequency cepstral coefficients, constant-Q transform, and cochleogram as input data. Our results confirm the superior classification performance combining cochleogram and ViT, highlighting the potential of ViT for reliable respiratory sound classification. This study contributes to the ongoing efforts in developing automatic intelligent techniques with the aim to significantly augment the speed and effectiveness of respiratory disease detection, thereby addressing a critical need in the medical field.
引用
收藏
页数:23
相关论文
共 50 条
  • [31] Enhanced astronomical source classification with integration of attention mechanisms and vision transformers
    Bhavanam, Srinadh Reddy
    Channappayya, Sumohana S.
    Srijith, P. K.
    Desai, Shantanu
    ASTROPHYSICS AND SPACE SCIENCE, 2024, 369 (08)
  • [32] Medicinal Plant Leaf Classification using Deep Learning and Vision Transformers
    Hossain, Shahriar
    Hasan, Rizbanul
    Uddin, Jia
    BAGHDAD SCIENCE JOURNAL, 2025, 22 (03) : 1065 - 1076
  • [33] Evaluating Deep CNNs and Vision Transformers for Plant Leaf Disease Classification
    Bhuyan, Parag
    Singh, Pranav Kumar
    DISTRIBUTED COMPUTING AND INTELLIGENT TECHNOLOGY, ICDCIT 2024, 2024, 14501 : 293 - 306
  • [34] Convolutional Nets Versus Vision Transformers for Diabetic Foot Ulcer Classification
    Galdran, Adrian
    Arneiro, Gustavo C.
    Gonzalez Ballester, Miguel A.
    DIABETIC FOOT ULCERS GRAND CHALLENGE (DFUC 2021), 2022, 13183 : 21 - 29
  • [35] Artwork Style Recognition Using Vision Transformers and MLP Mixer
    Iliadis, Lazaros Alexios
    Nikolaidis, Spyridon
    Sarigiannidis, Panagiotis
    Wan, Shaohua
    Goudos, Sotirios K.
    TECHNOLOGIES, 2022, 10 (01)
  • [36] Vision Transformers for Single Image Dehazing
    Song, Yuda
    He, Zhuqing
    Qian, Hui
    Du, Xin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1927 - 1941
  • [37] Vision Transformers for Vein Biometric Recognition
    Garcia-Martin, Raul
    Sanchez-Reillo, Raul
    IEEE ACCESS, 2023, 11 : 22060 - 22080
  • [38] Vision Transformers, Ensemble Model, and Transfer Learning Leveraging Explainable AI for Brain Tumor Detection and Classification
    Hossain, Shahriar
    Chakrabarty, Amitabha
    Gadekallu, Thippa Reddy
    Alazab, Mamoun
    Piran, Md. Jalil
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (03) : 1261 - 1272
  • [39] Preliminary Study on Patch Sizes in Vision Transformers (ViT) for COVID-19 and Diseased Lungs Classification
    Than, Joel C. M.
    Liang, Pun
    Rijal, Omar Mohd
    Kassim, Rosminah M.
    Yunus, Ashari
    Noor, Norliza M.
    Then, Patrick
    1ST NATIONAL BIOMEDICAL ENGINEERING CONFERENCE (NBEC 2021): ADVANCED TECHNOLOGY FOR MODERN HEALTHCARE, 2021, : 146 - 150
  • [40] DETECTION OF ABNORMAL LUNG SOUNDS TAKING INTO ACCOUNT DURATION DISTRIBUTION FOR ADVENTITIOUS SOUNDS
    Himeshima, Masataka
    Yamashita, Masaru
    Matsunaga, Shoichi
    Miyahara, Sueharu
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 1821 - 1825