Classification of Adventitious Sounds Combining Cochleogram and Vision Transformers

被引：4

作者：

Mang, Loredana Daria ^{[1
]}

Martinez, Francisco David Gonzalez ^{[1
]}

Munoz, Damian Martinez ^{[1
]}

Galan, Sebastian Garcia ^{[1
]}

Cortina, Raquel ^{[2
]}

机构：

[1] Univ Jaen, Dept Telecommun Engn, Linares 23700, Spain

[2] Univ Oviedo, Dept Comp Sci, Oviedo 33003, Spain

来源：

SENSORS | 2024年 / 24卷 / 02期

关键词：

classification; adventitious sounds; cochleogram; vision transformers; deep learning; accuracy; LUNG SOUNDS; FRACTAL DIMENSION; TIME-FREQUENCY; NEURAL-NETWORK; CNN MODEL; SEPARATION; FACTORIZATION; SYSTEM;

D O I：

10.3390/s24020682

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Early identification of respiratory irregularities is critical for improving lung health and reducing global mortality rates. The analysis of respiratory sounds plays a significant role in characterizing the respiratory system's condition and identifying abnormalities. The main contribution of this study is to investigate the performance when the input data, represented by cochleogram, is used to feed the Vision Transformer (ViT) architecture, since this input-classifier combination is the first time it has been applied to adventitious sound classification to our knowledge. Although ViT has shown promising results in audio classification tasks by applying self-attention to spectrogram patches, we extend this approach by applying the cochleogram, which captures specific spectro-temporal features of adventitious sounds. The proposed methodology is evaluated on the ICBHI dataset. We compare the classification performance of ViT with other state-of-the-art CNN approaches using spectrogram, Mel frequency cepstral coefficients, constant-Q transform, and cochleogram as input data. Our results confirm the superior classification performance combining cochleogram and ViT, highlighting the potential of ViT for reliable respiratory sound classification. This study contributes to the ongoing efforts in developing automatic intelligent techniques with the aim to significantly augment the speed and effectiveness of respiratory disease detection, thereby addressing a critical need in the medical field.

引用

页数：23

共 50 条

[1] Cochleogram-based adventitious sounds classification using convolutional neural networks
Mang, L. D.
Canadas-Quesada, F. J.
Carabias-Orti, J. J.
Combarro, E. F.
Ranilla, J.
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 82
[2] Quantum Vision Transformers for Quark-Gluon Classification
Comajoan Cara, Marcal
Dahale, Gopal Ramesh
Dong, Zhongtian
Forestano, Roy T.
Gleyzer, Sergei
Justice, Daniel
Kong, Kyoungchul
Magorsch, Tom
Matchev, Konstantin T.
Matcheva, Katia
Unlu, Eyup B.
AXIOMS, 2024, 13 (05)
[3] WASTE CLASSIFICATION USING VISION TRANSFORMERS
Puchianu, Dan Constantin
SCIENTIFIC PAPERS-SERIES E-LAND RECLAMATION EARTH OBSERVATION & SURVEYING ENVIRONMENTAL ENGINEERING, 2024, 13 : 727 - 733
[4] Methodology for Automatic Classification of Adventitious Lung Sounds
Riella, R. J.
Nohama, P.
Maia, J. M.
WORLD CONGRESS ON MEDICAL PHYSICS AND BIOMEDICAL ENGINEERING, VOL 25, PT 4: IMAGE PROCESSING, BIOSIGNAL PROCESSING, MODELLING AND SIMULATION, BIOMECHANICS, 2010, 25 : 1392 - 1395
[5] Hybrid Quantum Vision Transformers for Event Classification in High Energy Physics
Unlu, Eyup B.
Cara, Marcal Comajoan
Dahale, Gopal Ramesh
Dong, Zhongtian
Forestano, Roy T.
Gleyzer, Sergei
Justice, Daniel
Kong, Kyoungchul
Magorsch, Tom
Matchev, Konstantin T.
Matcheva, Katia
AXIOMS, 2024, 13 (03)
[6] Vision Transformers for Brain Tumor Classification
Simon, Eliott
Briassouli, Alexia
PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES (BIOIMAGING), VOL 2, 2021, : 123 - 130
[7] Classification and analysis of non-stationary characteristics of crackle and rhonchus lung adventitious sounds
Icer, Semra
Gengec, Serife
DIGITAL SIGNAL PROCESSING, 2014, 28 : 18 - 27
[8] Combining EfficientNet and Vision Transformers for Video Deepfake Detection
Coccomini, Davide Alessandro
Messina, Nicola
Gennaro, Claudio
Falchi, Fabrizio
IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT III, 2022, 13233 : 219 - 229
[9] Vision Transformers for Remote Sensing Image Classification
Bazi, Yakoub
Bashmal, Laila
Rahhal, Mohamad M. Al
Dayil, Reham Al
Ajlan, Naif Al
REMOTE SENSING, 2021, 13 (03) : 1 - 20
[10] Vision Transformers Applied to Indoor Room Classification
Veiga, Bruno
Pinto, Tiago
Teixeira, Ruben
Ramos, Carlos
PROGRESS IN ARTIFICIAL INTELLIGENCE, EPIA 2023, PT II, 2023, 14116 : 561 - 573

← 1 2 3 4 5 →