WARPED LANGUAGE MODELS FOR NOISE ROBUST LANGUAGE UNDERSTANDING

被引：4

作者：

Namazifar, Mahdi ^{[1
]}

Tur, Gokhan ^{[1
]}

Hakkani-Tur, Dilek ^{[1
]}

机构：

[1] Amazon Alexa AI, Seattle, WA 98109 USA

来源：

2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT) | 2021年

关键词：

D O I：

10.1109/SLT48900.2021.9383493

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Masked Language Models (MLM) are self-supervised neural networks trained to fill in the blanks in a given sentence with masked tokens. Despite the tremendous success of MLMs for various text based tasks, they are not robust for spoken language understanding, especially for spontaneous conversational speech recognition noise. In this work we introduce Warped Language Models (WLM) in which input sentences at training time go through the same modifications as in MLM, plus two additional modifications, namely inserting and dropping random tokens. These two modifications extend and contract the sentence in addition to the modifications in MLMs, hence the word "warped" in the name. The insertion and drop modification of the input text during training of WLM resemble the types of noise due to Automatic Speech Recognition (ASR) errors, and as a result WLMs are likely to be more robust to ASR noise. Through computational results we show that natural language understanding systems built on top of WLMs perform better compared to those built based on MLMs, especially in the presence of ASR errors.

引用

页码：981 / 988

页数：8

共 22 条

[1]

[Anonymous], 2019, ADV NEURAL INFORM PR

[2]

Chen Qian, 2019, BERT for joint intent classification and slot filling

[3]

Clark Kevin, 2020, ICLR

[4]

Dahl Deborah A., 1994, HUMAN LANGUAGE TECHN

[5]

De Meulder F, 2003, NAACL

[6]

Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171

[7]

Hemphill C. T., 1990, SPEECH NATURAL LANGU

[8]

Hrinchuk O, 2020, INT CONF ACOUST SPEE, P7074, DOI [10.1109/ICASSP40776.2020.9053051, 10.1109/icassp40776.2020.9053051]

[9]

Huang CW, 2020, INT CONF ACOUST SPEE, P8009, DOI [10.1109/ICASSP40776.2020.9054689, 10.1109/icassp40776.2020.9054689]

[10]

Irie Kazuki, 2019, INTERSPEECH

← 1 2 3 →