Improving Language-Universal Feature Extraction with Deep Maxout and Convolutional Neural Networks

被引:0
|
作者
Miao, Yajie [1 ]
Metze, Florian [1 ]
机构
[1] Carnegie Mellon Univ, Language Technol Inst, Sch Comp Sci, Pittsburgh, PA 15213 USA
来源
15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4 | 2014年
基金
美国国家科学基金会;
关键词
language-universal feature extraction; deep maxout networks; deep convolutional networks;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When deployed in automated speech recognition (ASR), deep neural networks (DNNs) can be treated as a complex feature extractor plus a simple linear classifier. Previous work has investigated the utility of multilingual DNNs acting as language-universal feature extractors (LUFEs). In this paper, we explore different strategies to further improve LUFEs. First, we replace the standard sigmoid nonlinearity with the recently proposed maxout units. The resulting maxout LUFEs have the nice property of generating sparse feature representations. Second, the convolutional neural network (CNN) architecture is applied to obtain more invariant feature space. We evaluate the performance of LUFEs on a cross-language ASR task. Each of the proposed techniques results in word error rate reduction compared with the existing DNN-based LUFEs. Combining the two methods together brings additional improvement on the target language.
引用
收藏
页码:800 / 804
页数:5
相关论文
共 50 条
  • [21] Hyperspectral Remote Sensing Images Deep Feature Extraction Based on Mixed Feature and Convolutional Neural Networks
    Liu, Jing
    Yang, Zhe
    Liu, Yi
    Mu, Caihong
    REMOTE SENSING, 2021, 13 (13)
  • [22] Topology Reduction in Deep Convolutional Feature Extraction Networks
    Wiatowski, Thomas
    Grohs, Philipp
    Bolcskei, Helmut
    WAVELETS AND SPARSITY XVII, 2017, 10394
  • [23] Development of Convolutional Neural Networks (CNNs) for Feature Extraction
    Eikmeier, Nicole
    Westerkamp, Rachel
    Zelnio, Edmund
    ALGORITHMS FOR SYNTHETIC APERTURE RADAR IMAGERY XXV, 2018, 10647
  • [24] Deep Convolutional Neural Networks as Generic Feature Extractors
    Hertel, Lars
    Barth, Erhardt
    Kaester, Thomas
    Martinetz, Thomas
    2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [25] Discriminative Feature Extraction with Deep Neural Networks
    Stuhlsatz, Andre
    Lippel, Jens
    Zielke, Thomas
    2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [26] IMPROVING THE CAPACITY OF VERY DEEP NETWORKS WITH MAXOUT UNITS
    Oyedotun, Oyebade K.
    Shabayek, Abd El Rahman
    Aouada, Djamila
    Ottersten, Bjorn
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2971 - 2975
  • [27] Feature extraction and classification using deep convolutional neural networks, PCA and SVC for face recognition
    Benkaddour, Mohammed Kamel
    Bounoua, Abdennacer
    TRAITEMENT DU SIGNAL, 2017, 34 (1-2) : 77 - 91
  • [28] Context extraction module for deep convolutional neural networks
    Singh, Pravendra
    Mazumder, Pratik
    Namboodiri, Vinay P.
    PATTERN RECOGNITION, 2022, 122
  • [29] Deep Convolutional Neural Networks for Sign Language Recognition
    Rao, G. Anantha
    Syamala, K.
    Kishore, P. V. V.
    Sastry, A. S. C. S.
    2018 CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION ENGINEERING SYSTEMS (SPACES), 2018, : 194 - 197
  • [30] Improving Performance of Convolutional Neural Networks via Feature Embedding
    Ghoshal, Torumoy
    Zhang, Silu
    Dang, Xin
    Wilkins, Dawn
    Chen, Yixin
    PROCEEDINGS OF THE 2019 ANNUAL ACM SOUTHEAST CONFERENCE (ACMSE 2019), 2019, : 31 - 38