Classification of Adventitious Sounds Combining Cochleogram and Vision Transformers

被引：4

作者：

Mang, Loredana Daria ^{[1
]}

Martinez, Francisco David Gonzalez ^{[1
]}

Munoz, Damian Martinez ^{[1
]}

Galan, Sebastian Garcia ^{[1
]}

Cortina, Raquel ^{[2
]}

机构：

[1] Univ Jaen, Dept Telecommun Engn, Linares 23700, Spain

[2] Univ Oviedo, Dept Comp Sci, Oviedo 33003, Spain

来源：

SENSORS | 2024年 / 24卷 / 02期

关键词：

classification; adventitious sounds; cochleogram; vision transformers; deep learning; accuracy; LUNG SOUNDS; FRACTAL DIMENSION; TIME-FREQUENCY; NEURAL-NETWORK; CNN MODEL; SEPARATION; FACTORIZATION; SYSTEM;

D O I：

10.3390/s24020682

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Early identification of respiratory irregularities is critical for improving lung health and reducing global mortality rates. The analysis of respiratory sounds plays a significant role in characterizing the respiratory system's condition and identifying abnormalities. The main contribution of this study is to investigate the performance when the input data, represented by cochleogram, is used to feed the Vision Transformer (ViT) architecture, since this input-classifier combination is the first time it has been applied to adventitious sound classification to our knowledge. Although ViT has shown promising results in audio classification tasks by applying self-attention to spectrogram patches, we extend this approach by applying the cochleogram, which captures specific spectro-temporal features of adventitious sounds. The proposed methodology is evaluated on the ICBHI dataset. We compare the classification performance of ViT with other state-of-the-art CNN approaches using spectrogram, Mel frequency cepstral coefficients, constant-Q transform, and cochleogram as input data. Our results confirm the superior classification performance combining cochleogram and ViT, highlighting the potential of ViT for reliable respiratory sound classification. This study contributes to the ongoing efforts in developing automatic intelligent techniques with the aim to significantly augment the speed and effectiveness of respiratory disease detection, thereby addressing a critical need in the medical field.

引用

页数：23

共 50 条

[21] Vision Transformers for Anomaly Classification and Localization in Optical Networks Using SOP Spectrograms
Abdelli, Khouloud
Lonardi, Matteo
Boitier, Fabien
Correa, Diego
Gripp, Jurgen
Olsson, Samuel
Layec, Patricia
JOURNAL OF LIGHTWAVE TECHNOLOGY, 2025, 43 (04) : 1902 - 1914
[22] Distilling Knowledge From an Ensemble of Vision Transformers for Improved Classification of Breast Ultrasound
Zhou, George
Mosadegh, Bobak
ACADEMIC RADIOLOGY, 2024, 31 (01) : 104 - 120
[23] Adaptive Knowledge Distillation for Classification of Hand Images Using Explainable Vision Transformers
Thanh Thi Nguyen
Wilson, Campbell
Dalins, Janis
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES-RESEARCH TRACK AND DEMO TRACK, PT VIII, ECML PKDD 2024, 2024, 14948 : 235 - 252
[24] Enhancing furcation involvement classification on panoramic radiographs with vision transformers
Zhang, Xuan
Guo, Enting
Liu, Xu
Zhao, Hong
Yang, Jie
Li, Wen
Wu, Wenlei
Sun, Weibin
BMC ORAL HEALTH, 2025, 25 (01):
[25] CLASSIFICATION OF BRAIN TISSUES IN HYPERSPECTRAL IMAGES USING VISION TRANSFORMERS
Cruz-Guerrero, Ines A.
Mendoza-Chavarria, Juan N.
Campos-Delgado, Daniel U.
Fabelo, Himar
Ortega, Samuel
Marrero Callico, Gustavo
2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
[26] Art authentication with vision transformers
Schaerf, Ludovica
Postma, Eric
Popovici, Carina
NEURAL COMPUTING & APPLICATIONS, 2023, 36 (20): : 11849 - 11858
[27] Advancing precision in breast cancer detection: a fusion of vision transformers and CNNs for calcification mammography classification
Boudouh, Saida Sarra
Bouakkaz, Mustapha
APPLIED INTELLIGENCE, 2024, 54 (17-18) : 8170 - 8183
[28] Lv-Adapter: Adapting Vision Transformers for Visual Classification with Linear-layers and Vectors
Xu, Guangyi
Ye, Junyong
Liu, Xinyuan
Wen, Xubin
Li, Youwei
Wang, Jingjing
COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 246
[29] Person Identification and Gender Classification Based on Vision Transformers for Periocular Images
Suravarapu, Vasu Krishna
Patil, Hemprasad Yashwant
APPLIED SCIENCES-BASEL, 2023, 13 (05):
[30] Optimizing Vision Transformers for Histopathology: Pretraining and Normalization in Breast Cancer Classification
Baroni, Giulia Lucrezia
Rasotto, Laura
Roitero, Kevin
Tulisso, Angelica
Di Loreto, Carla
Della Mea, Vincenzo
JOURNAL OF IMAGING, 2024, 10 (05)

← 1 2 3 4 5 →