Improving Language-Universal Feature Extraction with Deep Maxout and Convolutional Neural Networks

被引:0
|
作者
Miao, Yajie [1 ]
Metze, Florian [1 ]
机构
[1] Carnegie Mellon Univ, Language Technol Inst, Sch Comp Sci, Pittsburgh, PA 15213 USA
基金
美国国家科学基金会;
关键词
language-universal feature extraction; deep maxout networks; deep convolutional networks;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When deployed in automated speech recognition (ASR), deep neural networks (DNNs) can be treated as a complex feature extractor plus a simple linear classifier. Previous work has investigated the utility of multilingual DNNs acting as language-universal feature extractors (LUFEs). In this paper, we explore different strategies to further improve LUFEs. First, we replace the standard sigmoid nonlinearity with the recently proposed maxout units. The resulting maxout LUFEs have the nice property of generating sparse feature representations. Second, the convolutional neural network (CNN) architecture is applied to obtain more invariant feature space. We evaluate the performance of LUFEs on a cross-language ASR task. Each of the proposed techniques results in word error rate reduction compared with the existing DNN-based LUFEs. Combining the two methods together brings additional improvement on the target language.
引用
收藏
页码:800 / 804
页数:5
相关论文
共 50 条
  • [1] Improving deep convolutional neural networks with mixed maxout units
    Zhao, Hui-zhen
    Liu, Fu-xian
    Li, Long-yue
    PLOS ONE, 2017, 12 (07):
  • [2] Improving Deep Neural Networks with Multilayer Maxout Networks
    Sun, Weichen
    Su, Fei
    Wang, Leiquan
    2014 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING CONFERENCE, 2014, : 334 - 337
  • [3] COMO: Efficient Deep Neural Networks Expansion With COnvolutional MaxOut
    Zhao, Baoxin
    Xiong, Haoyi
    Bian, Jiang
    Guo, Zhishan
    Xu, Cheng-Zhong
    Dou, Dejing
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 1722 - 1730
  • [4] IMPROVING DEEP CONVOLUTIONAL NEURAL NETWORKS WITH UNSUPERVISED FEATURE LEARNING
    Kien Nguyen
    Fookes, Clinton
    Sridharan, Sridha
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 2270 - 2274
  • [5] A Mathematical Theory of Deep Convolutional Neural Networks for Feature Extraction
    Wiatowski, Thomas
    Bolcskei, Helmut
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2018, 64 (03) : 1845 - 1866
  • [6] Regularized Deep Convolutional Neural Networks for Feature Extraction and Classification
    Jayech, Khaoula
    NEURAL INFORMATION PROCESSING (ICONIP 2017), PT II, 2017, 10635 : 431 - 439
  • [7] Maxout neurons for deep convolutional and LSTM neural networks in speech recognition
    Cai, Meng
    Liu, Jia
    SPEECH COMMUNICATION, 2016, 77 : 53 - 64
  • [8] Deep Convolutional Neural Networks for Feature Extraction in Speech Emotion Recognition
    Heracleous, Panikos
    Mohammad, Yasser
    Yoneyama, Akio
    HUMAN-COMPUTER INTERACTION. RECOGNITION AND INTERACTION TECHNOLOGIES, HCI 2019, PT II, 2019, 11567 : 117 - 132
  • [9] Convolutional Deep Maxout Networks for Phone Recognition
    Toth, Laszlo
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1078 - 1082
  • [10] Convolutional Maxout Neural Networks for Speech Separation
    Hui, Like
    Cai, Meng
    Guo, Cong
    He, Liang
    Zhang, Wei-Qiang
    Liu, Jia
    2015 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2015, : 24 - 27