Classification of Adventitious Sounds Combining Cochleogram and Vision Transformers

被引:4
作者
Mang, Loredana Daria [1 ]
Martinez, Francisco David Gonzalez [1 ]
Munoz, Damian Martinez [1 ]
Galan, Sebastian Garcia [1 ]
Cortina, Raquel [2 ]
机构
[1] Univ Jaen, Dept Telecommun Engn, Linares 23700, Spain
[2] Univ Oviedo, Dept Comp Sci, Oviedo 33003, Spain
关键词
classification; adventitious sounds; cochleogram; vision transformers; deep learning; accuracy; LUNG SOUNDS; FRACTAL DIMENSION; TIME-FREQUENCY; NEURAL-NETWORK; CNN MODEL; SEPARATION; FACTORIZATION; SYSTEM;
D O I
10.3390/s24020682
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Early identification of respiratory irregularities is critical for improving lung health and reducing global mortality rates. The analysis of respiratory sounds plays a significant role in characterizing the respiratory system's condition and identifying abnormalities. The main contribution of this study is to investigate the performance when the input data, represented by cochleogram, is used to feed the Vision Transformer (ViT) architecture, since this input-classifier combination is the first time it has been applied to adventitious sound classification to our knowledge. Although ViT has shown promising results in audio classification tasks by applying self-attention to spectrogram patches, we extend this approach by applying the cochleogram, which captures specific spectro-temporal features of adventitious sounds. The proposed methodology is evaluated on the ICBHI dataset. We compare the classification performance of ViT with other state-of-the-art CNN approaches using spectrogram, Mel frequency cepstral coefficients, constant-Q transform, and cochleogram as input data. Our results confirm the superior classification performance combining cochleogram and ViT, highlighting the potential of ViT for reliable respiratory sound classification. This study contributes to the ongoing efforts in developing automatic intelligent techniques with the aim to significantly augment the speed and effectiveness of respiratory disease detection, thereby addressing a critical need in the medical field.
引用
收藏
页数:23
相关论文
共 50 条
  • [41] Cleft Lip and Palate Classification Through Vision Transformers and Siamese Neural Networks
    Nantha, Oraphan
    Sathanarugsawait, Benjaporn
    Praneetpolgrang, Prasong
    [J]. JOURNAL OF IMAGING, 2024, 10 (11)
  • [42] Data Augmentation in Histopathological Classification: An Analysis Exploring GANs with XAI and Vision Transformers
    Rozendo, Guilherme Botazzo
    Garcia, Bianca Lanconi de Oliveira
    Borgue, Vinicius Augusto Toreli
    Lumini, Alessandra
    Tosta, Thaina Aparecida Azevedo
    do Nascimento, Marcelo Zanchetta
    Neves, Leandro Alves
    [J]. APPLIED SCIENCES-BASEL, 2024, 14 (18):
  • [43] Explainable AI and vision transformers for detection and classification of brain tumor: a comprehensive survey
    Khalid M. Hosny
    Mahmoud A. Mohammed
    [J]. Artificial Intelligence Review, 58 (9)
  • [44] A Novel Approach for Breast Tumor MRI Classification: Vision Transformers and Majority Integration
    Xue, Junpei
    Zhou, Leilei
    Chen, Yuchen
    Zheng, Jin-Xia
    Liu, Jiang
    [J]. ICC 2024 - IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2024, : 226 - 231
  • [45] Enhancing sugarcane leaf disease classification using vision transformers over CNNs
    Saritha Miryala
    Krupa Rasane
    [J]. Discover Artificial Intelligence, 5 (1):
  • [46] Taming vision transformers for clinical laryngoscopy assessment
    Zhang, Xinzhu
    Zhao, Jing
    Zong, Daoming
    Ren, Henglei
    Gao, Chunli
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2025, 162
  • [47] Explainable Vision Transformers for Vein Biometric Recognition
    Albano, Rocco
    Giusti, Lorenzo
    Maiorana, Emanuele
    Campisi, Patrizio
    [J]. IEEE ACCESS, 2024, 12 : 60436 - 60446
  • [48] Vision Transformers for Lung Segmentation on CXR Images
    Ghali R.
    Akhloufi M.A.
    [J]. SN Computer Science, 4 (4)
  • [49] Cluster analysis and classification of heart sounds
    Amit, Guy
    Gavriely, Noam
    Intrator, Nathan
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2009, 4 (01) : 26 - 36
  • [50] Through-Ice Acoustic Source Tracking Using Vision Transformers with Ordinal Classification
    Whitaker, Steven
    Barnard, Andrew
    Anderson, George D.
    Havens, Timothy C.
    [J]. SENSORS, 2022, 22 (13)