Classification of Adventitious Sounds Combining Cochleogram and Vision Transformers

被引:4
|
作者
Mang, Loredana Daria [1 ]
Martinez, Francisco David Gonzalez [1 ]
Munoz, Damian Martinez [1 ]
Galan, Sebastian Garcia [1 ]
Cortina, Raquel [2 ]
机构
[1] Univ Jaen, Dept Telecommun Engn, Linares 23700, Spain
[2] Univ Oviedo, Dept Comp Sci, Oviedo 33003, Spain
关键词
classification; adventitious sounds; cochleogram; vision transformers; deep learning; accuracy; LUNG SOUNDS; FRACTAL DIMENSION; TIME-FREQUENCY; NEURAL-NETWORK; CNN MODEL; SEPARATION; FACTORIZATION; SYSTEM;
D O I
10.3390/s24020682
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Early identification of respiratory irregularities is critical for improving lung health and reducing global mortality rates. The analysis of respiratory sounds plays a significant role in characterizing the respiratory system's condition and identifying abnormalities. The main contribution of this study is to investigate the performance when the input data, represented by cochleogram, is used to feed the Vision Transformer (ViT) architecture, since this input-classifier combination is the first time it has been applied to adventitious sound classification to our knowledge. Although ViT has shown promising results in audio classification tasks by applying self-attention to spectrogram patches, we extend this approach by applying the cochleogram, which captures specific spectro-temporal features of adventitious sounds. The proposed methodology is evaluated on the ICBHI dataset. We compare the classification performance of ViT with other state-of-the-art CNN approaches using spectrogram, Mel frequency cepstral coefficients, constant-Q transform, and cochleogram as input data. Our results confirm the superior classification performance combining cochleogram and ViT, highlighting the potential of ViT for reliable respiratory sound classification. This study contributes to the ongoing efforts in developing automatic intelligent techniques with the aim to significantly augment the speed and effectiveness of respiratory disease detection, thereby addressing a critical need in the medical field.
引用
收藏
页数:23
相关论文
共 50 条
  • [21] Vision Transformers for Anomaly Classification and Localization in Optical Networks Using SOP Spectrograms
    Abdelli, Khouloud
    Lonardi, Matteo
    Boitier, Fabien
    Correa, Diego
    Gripp, Jurgen
    Olsson, Samuel
    Layec, Patricia
    JOURNAL OF LIGHTWAVE TECHNOLOGY, 2025, 43 (04) : 1902 - 1914
  • [22] Distilling Knowledge From an Ensemble of Vision Transformers for Improved Classification of Breast Ultrasound
    Zhou, George
    Mosadegh, Bobak
    ACADEMIC RADIOLOGY, 2024, 31 (01) : 104 - 120
  • [23] Adaptive Knowledge Distillation for Classification of Hand Images Using Explainable Vision Transformers
    Thanh Thi Nguyen
    Wilson, Campbell
    Dalins, Janis
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES-RESEARCH TRACK AND DEMO TRACK, PT VIII, ECML PKDD 2024, 2024, 14948 : 235 - 252
  • [24] Enhancing furcation involvement classification on panoramic radiographs with vision transformers
    Zhang, Xuan
    Guo, Enting
    Liu, Xu
    Zhao, Hong
    Yang, Jie
    Li, Wen
    Wu, Wenlei
    Sun, Weibin
    BMC ORAL HEALTH, 2025, 25 (01):
  • [25] CLASSIFICATION OF BRAIN TISSUES IN HYPERSPECTRAL IMAGES USING VISION TRANSFORMERS
    Cruz-Guerrero, Ines A.
    Mendoza-Chavarria, Juan N.
    Campos-Delgado, Daniel U.
    Fabelo, Himar
    Ortega, Samuel
    Marrero Callico, Gustavo
    2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
  • [26] Art authentication with vision transformers
    Schaerf, Ludovica
    Postma, Eric
    Popovici, Carina
    NEURAL COMPUTING & APPLICATIONS, 2023, 36 (20): : 11849 - 11858
  • [27] Advancing precision in breast cancer detection: a fusion of vision transformers and CNNs for calcification mammography classification
    Boudouh, Saida Sarra
    Bouakkaz, Mustapha
    APPLIED INTELLIGENCE, 2024, 54 (17-18) : 8170 - 8183
  • [28] Lv-Adapter: Adapting Vision Transformers for Visual Classification with Linear-layers and Vectors
    Xu, Guangyi
    Ye, Junyong
    Liu, Xinyuan
    Wen, Xubin
    Li, Youwei
    Wang, Jingjing
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 246
  • [29] Person Identification and Gender Classification Based on Vision Transformers for Periocular Images
    Suravarapu, Vasu Krishna
    Patil, Hemprasad Yashwant
    APPLIED SCIENCES-BASEL, 2023, 13 (05):
  • [30] Optimizing Vision Transformers for Histopathology: Pretraining and Normalization in Breast Cancer Classification
    Baroni, Giulia Lucrezia
    Rasotto, Laura
    Roitero, Kevin
    Tulisso, Angelica
    Di Loreto, Carla
    Della Mea, Vincenzo
    JOURNAL OF IMAGING, 2024, 10 (05)