Digit Recognition Applied to Reconstructed Audio Signals Using Deep Learning

被引:0
|
作者
Toufa, Anastasia-Sotiria [1 ]
Kotropoulos, Constantine [1 ]
机构
[1] Aristotle Univ Thessaloniki, Dept Informat, Thessaloniki 54124, Greece
关键词
D O I
10.1109/ICPR48806.2021.9413183
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Compressed sensing allows signal reconstruction from a few measurements. This work proposes a complete pipeline for digit recognition applied to audio reconstructed signals. The reconstruction procedure exploits the assumption that the original signal lies in the range of a generator. A pretrained generator of a Generative Adversarial Network generates audio digits. A new method for reconstruction is proposed, using only the most active segment of the signal, i.e., the segment with the highest energy. The underlying assumption is that such segment offers a more compact representation, preserving the meaningful content of signal. Cases when the reconstruction produces noise, instead of digit, are treated as outliers. In order to detect and reject them, three unsupervised indicators are used, namely, the total energy of reconstructed signal, the predictions of an one-class Support Vector Machine, and the confidence of a pretrained classifier used for recognition. This classifier is based on neural networks architectures and is pretrained on original audio recordings, employing three input representations, i.e., raw audio, spectrogram, and gammatonegram. Experiments are conducted, analyzing both the quality of reconstruction and the performance of classifiers in digit recognition, demonstrating that the proposed method yields higher performance in both the quality of reconstruction and digit recognition accuracy.
引用
收藏
页码:3050 / 3057
页数:8
相关论文
共 50 条
  • [1] Analysis of Audio Signals Using Deep Learning Algorithms Applied to COVID Diagnostic Systems
    Bello Rivera, Miguel Angel
    Quintero Flores, Perfecto Malaquias
    Perez Loaiza, Rodolfo Eleazar
    Gomez Rivera, Leticia
    2022 IEEE MEXICAN INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE (ENC), 2022,
  • [2] Handwritten Geez Digit Recognition Using Deep Learning
    Ali Nur, Mukerem
    Abebe, Mesfin
    Rajendran, Rajesh Sharma
    APPLIED COMPUTATIONAL INTELLIGENCE AND SOFT COMPUTING, 2022, 2022
  • [3] A Novel Technique for Handwritten Digit Recognition Using Deep Learning
    Ahmed, Syed Sohail
    Mehmood, Zahid
    Awan, Imran Ahmad
    Yousaf, Rehan Mehmood
    JOURNAL OF SENSORS, 2023, 2023
  • [4] Audio Recognition Using Deep Learning for Edge Devices
    Kulkarni, Aditya
    Jabade, Vaishali
    Patil, Aniket
    ADVANCES IN COMPUTING AND DATA SCIENCES (ICACDS 2022), PT II, 2022, 1614 : 186 - 198
  • [5] Deep Learning for Activity Recognition Using Audio and Video
    Reinolds, Francisco
    Neto, Cristiana
    Machado, Jose
    ELECTRONICS, 2022, 11 (05)
  • [6] Deep Learning Accelerator on FPGA Using Handwritten Digit Recognition for Example
    Vo Thanh Phat
    Pham Huu Tho
    Ha Binh Dat
    Chou, Chung-Han
    2018 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-TAIWAN (ICCE-TW), 2018,
  • [7] Improved Handwritten Digit Recognition method using Deep Learning Algorithm
    Jantayev, Ruslan
    Amirgaliyev, Yedilkhan
    2019 15TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTER AND COMPUTATION (ICECCO), 2019,
  • [8] Persian handwritten digit, character and word recognition using deep learning
    Bonyani, Mahdi
    Jahangard, Simindokht
    Daneshmand, Morteza
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2021, 24 (1-2) : 133 - 143
  • [9] Persian handwritten digit, character and word recognition using deep learning
    Mahdi Bonyani
    Simindokht Jahangard
    Morteza Daneshmand
    International Journal on Document Analysis and Recognition (IJDAR), 2021, 24 : 133 - 143
  • [10] Speech Emotion Recognition Using Deep Learning on audio recordings
    Suganya, S.
    Charles, E. Y. A.
    2019 19TH INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER - 2019), 2019,