Spoken Language Identification with Deep Convolutional Neural Network and Data Augmentation

被引:0
|
作者
Korkut, Can [1 ]
Haznedaroglu, Ali [1 ]
Arslan, Levent M. [1 ,2 ]
机构
[1] Sestek, Istanbul, Turkey
[2] Bogazici Univ, Elekt Elekt Muhendisligi Bolumu, Istanbul, Turkey
关键词
Spoken Language Identification; CNN; Data Augmentation; SPEECH;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, a spoken language detection system based on deep convolutional neural networks is presented. The neural network model is trained and tested on a speech dataset containing five languages. Speech signals are first converted into mel-spectrogram features and these features are fed into the deep convolutional neural network. Flattened outputs of the deep convolutional network are then fed into a recurrent layer, and a dense layer with softmax activation function is used as an output layer to predict the output language probabilities. This network results in 0.89 F1-score in our test data. We also used a data augmentation method, namely Spec Augment, which increased the F1-score to 0.94.
引用
收藏
页数:4
相关论文
共 50 条
  • [41] Application of Deep Convolutional Neural Networks in Attention-Deficit/Hyperactivity Disorder Classification: Data Augmentation and Convolutional Neural Network Transfer Learning
    Zhu, Li
    Chang, Weike
    JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2019, 9 (08) : 1717 - 1724
  • [42] A convolutional neural network approach for gender and language variety identification
    Gomez-Adorno, Helena
    Fuentes-Alba, Roddy
    Markov, Ilia
    Sidorov, Grigori
    Gelbukh, Alexander
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 36 (05) : 4845 - 4855
  • [43] Language Identification using Stacked Convolutional Neural Network (SCNN)
    Bohra, Navdeep
    Bhatnagar, Vishal
    2021 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2021), 2021, : 20 - 25
  • [44] Alcoholism Detection by Data Augmentation and Convolutional Neural Network with Stochastic Pooling
    Shui-Hua Wang
    Yi-Ding Lv
    Yuxiu Sui
    Shuai Liu
    Su-Jing Wang
    Yu-Dong Zhang
    Journal of Medical Systems, 2018, 42
  • [45] Roman Amphitheater Classification Using Convolutional Neural Network and Data Augmentation
    Nakouri, Haifa
    PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES, PDCAT 2021, 2022, 13148 : 476 - 484
  • [46] Data augmentation in EUV lithography simulation based on convolutional neural network
    Tanabe, Hiroyoshi
    Takahashi, Atsushi
    DTCO AND COMPUTATIONAL PATTERNING, 2022, 12052
  • [47] Cut-Thumbnail: A Novel Data Augmentation for Convolutional Neural Network
    Xie, Tianshu
    Cheng, Xuan
    Wang, Xiaomin
    Liu, Minghui
    Deng, Jiali
    Zhou, Tao
    Liu, Ming
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1627 - 1635
  • [48] Alcoholism Detection by Data Augmentation and Convolutional Neural Network with Stochastic Pooling
    Wang, Shui-Hua
    Lv, Yi-Ding
    Sui, Yuxiu
    Liu, Shuai
    Wang, Su-Jing
    Zhang, Yu-Dong
    JOURNAL OF MEDICAL SYSTEMS, 2018, 42 (01)
  • [49] Facial Expression Recognition using Convolutional Neural Network with Data Augmentation
    Ahmed, Tawsin Uddin
    Hossain, Sazzad
    Hossain, Mohammad Shahadat
    Ul Islam, Raihan
    Andersson, Karl
    2019 JOINT 8TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS & VISION (ICIEV) AND 2019 3RD INTERNATIONAL CONFERENCE ON IMAGING, VISION & PATTERN RECOGNITION (ICIVPR) WITH INTERNATIONAL CONFERENCE ON ACTIVITY AND BEHAVIOR COMPUTING (ABC), 2019, : 336 - 341
  • [50] Convolutional Neural Network and Data Augmentation Method for Electricity Theft Detection
    Zhou, Yu
    Zhang, Xuecen
    Tang, Yi
    Mu, Zhuowen
    Shao, Xuesong
    Li, Yue
    Cai, Qixin
    2021 IEEE IAS INDUSTRIAL AND COMMERCIAL POWER SYSTEM ASIA (IEEE I&CPS ASIA 2021), 2021, : 1525 - 1530