Experiments in Character-level Neural Network Models for Punctuation

被引:7
作者
Gale, William [1 ]
Parthasarathy, Sarangarajan [2 ]
机构
[1] Univ Adelaide, Adelaide, SA, Australia
[2] Microsoft, Redmond, WA USA
来源
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION | 2017年
关键词
speech recognition; punctuation prediction; neural networks;
D O I
10.21437/Interspeech.2017-1710
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We explore character-level neural network models for inferring punctuation from text-only input. Punctuation inference is treated as a sequence tagging problem where the input is a sequence of un-punctuated characters, and the output is a corresponding sequence of punctuation tags. We experiment with six architectures, all of which use a long short-term memory (LSTM) network for sequence modeling. They differ in the way the context and lookahead for a given character is derived: from simple character embedding and delayed output to enable lookahead, to complex convolutional neural networks (CNN) to capture context. We demonstrate that the accuracy of proposed character-level models are competitive with the accuracy of a state-of-the-art word-level Conditional Random Field (CRF) baseline with carefully crafted features.
引用
收藏
页码:2794 / 2798
页数:5
相关论文
共 50 条
[41]   A general steganographic framework for neural network models [J].
Yang, Ziyun ;
Wang, Zichi ;
Zhang, Xinpeng .
INFORMATION SCIENCES, 2023, 643
[42]   Neural network models in a set selection problem [J].
Romanovsky, YR ;
Brovko, ED .
NEURAL NETWORKS, 1999, 12 (01) :135-143
[43]   Autoprogressive training of neural network constitutive models [J].
Ghaboussi, J ;
Pecknold, DA ;
Zhang, MF ;
Haj-Ali, RM .
INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN ENGINEERING, 1998, 42 (01) :105-126
[44]   Neural network models for breast cancer prognosis [J].
Ripley, RM ;
Harris, AL ;
Tarassenko, L .
NEURAL COMPUTING & APPLICATIONS, 1998, 7 (04) :367-375
[45]   Model structure determination in neural network models [J].
Henrique, HM ;
Lima, EL ;
Seborg, DE .
CHEMICAL ENGINEERING SCIENCE, 2000, 55 (22) :5457-5469
[46]   The dual of the space of interactions in neural network models [J].
De Martino, D. .
INTERNATIONAL JOURNAL OF MODERN PHYSICS C, 2016, 27 (06)
[47]   Neural network models for time series forecasts [J].
Hill, T ;
OConnor, M ;
Remus, W .
MANAGEMENT SCIENCE, 1996, 42 (07) :1082-1092
[48]   SCALING RECURRENT NEURAL NETWORK LANGUAGE MODELS [J].
Williams, Will ;
Prasad, Niranjani ;
Mrva, David ;
Ash, Tom ;
Robinson, Tony .
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, :5391-5395
[49]   Why Scholars Are Diagramming Neural Network Models [J].
Marshall, Guy Clarke ;
Jay, Caroline ;
Freitas, Andre .
DIAGRAMMATIC REPRESENTATION AND INFERENCE, DIAGRAMS 2022, 2022, 13462 :202-209
[50]   NEURAL NETWORK MODELS AND THE PREDICTION OF BANK BANKRUPTCY [J].
TAM, KY .
OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 1991, 19 (05) :429-445