Classification of Adventitious Sounds Combining Cochleogram and Vision Transformers

被引:4
|
作者
Mang, Loredana Daria [1 ]
Martinez, Francisco David Gonzalez [1 ]
Munoz, Damian Martinez [1 ]
Galan, Sebastian Garcia [1 ]
Cortina, Raquel [2 ]
机构
[1] Univ Jaen, Dept Telecommun Engn, Linares 23700, Spain
[2] Univ Oviedo, Dept Comp Sci, Oviedo 33003, Spain
关键词
classification; adventitious sounds; cochleogram; vision transformers; deep learning; accuracy; LUNG SOUNDS; FRACTAL DIMENSION; TIME-FREQUENCY; NEURAL-NETWORK; CNN MODEL; SEPARATION; FACTORIZATION; SYSTEM;
D O I
10.3390/s24020682
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Early identification of respiratory irregularities is critical for improving lung health and reducing global mortality rates. The analysis of respiratory sounds plays a significant role in characterizing the respiratory system's condition and identifying abnormalities. The main contribution of this study is to investigate the performance when the input data, represented by cochleogram, is used to feed the Vision Transformer (ViT) architecture, since this input-classifier combination is the first time it has been applied to adventitious sound classification to our knowledge. Although ViT has shown promising results in audio classification tasks by applying self-attention to spectrogram patches, we extend this approach by applying the cochleogram, which captures specific spectro-temporal features of adventitious sounds. The proposed methodology is evaluated on the ICBHI dataset. We compare the classification performance of ViT with other state-of-the-art CNN approaches using spectrogram, Mel frequency cepstral coefficients, constant-Q transform, and cochleogram as input data. Our results confirm the superior classification performance combining cochleogram and ViT, highlighting the potential of ViT for reliable respiratory sound classification. This study contributes to the ongoing efforts in developing automatic intelligent techniques with the aim to significantly augment the speed and effectiveness of respiratory disease detection, thereby addressing a critical need in the medical field.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Cochleogram-based adventitious sounds classification using convolutional neural networks
    Mang, L. D.
    Canadas-Quesada, F. J.
    Carabias-Orti, J. J.
    Combarro, E. F.
    Ranilla, J.
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 82
  • [2] Quantum Vision Transformers for Quark-Gluon Classification
    Comajoan Cara, Marcal
    Dahale, Gopal Ramesh
    Dong, Zhongtian
    Forestano, Roy T.
    Gleyzer, Sergei
    Justice, Daniel
    Kong, Kyoungchul
    Magorsch, Tom
    Matchev, Konstantin T.
    Matcheva, Katia
    Unlu, Eyup B.
    AXIOMS, 2024, 13 (05)
  • [3] WASTE CLASSIFICATION USING VISION TRANSFORMERS
    Puchianu, Dan Constantin
    SCIENTIFIC PAPERS-SERIES E-LAND RECLAMATION EARTH OBSERVATION & SURVEYING ENVIRONMENTAL ENGINEERING, 2024, 13 : 727 - 733
  • [4] Methodology for Automatic Classification of Adventitious Lung Sounds
    Riella, R. J.
    Nohama, P.
    Maia, J. M.
    WORLD CONGRESS ON MEDICAL PHYSICS AND BIOMEDICAL ENGINEERING, VOL 25, PT 4: IMAGE PROCESSING, BIOSIGNAL PROCESSING, MODELLING AND SIMULATION, BIOMECHANICS, 2010, 25 : 1392 - 1395
  • [5] Hybrid Quantum Vision Transformers for Event Classification in High Energy Physics
    Unlu, Eyup B.
    Cara, Marcal Comajoan
    Dahale, Gopal Ramesh
    Dong, Zhongtian
    Forestano, Roy T.
    Gleyzer, Sergei
    Justice, Daniel
    Kong, Kyoungchul
    Magorsch, Tom
    Matchev, Konstantin T.
    Matcheva, Katia
    AXIOMS, 2024, 13 (03)
  • [6] Vision Transformers for Brain Tumor Classification
    Simon, Eliott
    Briassouli, Alexia
    PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES (BIOIMAGING), VOL 2, 2021, : 123 - 130
  • [7] Classification and analysis of non-stationary characteristics of crackle and rhonchus lung adventitious sounds
    Icer, Semra
    Gengec, Serife
    DIGITAL SIGNAL PROCESSING, 2014, 28 : 18 - 27
  • [8] Combining EfficientNet and Vision Transformers for Video Deepfake Detection
    Coccomini, Davide Alessandro
    Messina, Nicola
    Gennaro, Claudio
    Falchi, Fabrizio
    IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT III, 2022, 13233 : 219 - 229
  • [9] Vision Transformers for Remote Sensing Image Classification
    Bazi, Yakoub
    Bashmal, Laila
    Rahhal, Mohamad M. Al
    Dayil, Reham Al
    Ajlan, Naif Al
    REMOTE SENSING, 2021, 13 (03) : 1 - 20
  • [10] Vision Transformers Applied to Indoor Room Classification
    Veiga, Bruno
    Pinto, Tiago
    Teixeira, Ruben
    Ramos, Carlos
    PROGRESS IN ARTIFICIAL INTELLIGENCE, EPIA 2023, PT II, 2023, 14116 : 561 - 573