MULTITURNCLEANUP: A Benchmark for Multi-Turn Spoken Conversational Transcript Cleanup

被引:0
|
作者
Shen, Hua [1 ,2 ]
Zayats, Vicky [2 ]
Rocholl, Johann C. [2 ]
Walker, Daniel D. [2 ]
Padfield, Dirk [2 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
[2] Google Res, Mountain View, CA 94043 USA
来源
2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023) | 2023年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current disfluency detection models focus on individual utterances each from a single speaker. However, numerous discontinuity phenomena in spoken conversational transcripts occur across multiple turns, which can not be identified by disfluency detection models. This study addresses these phenomena by proposing an innovative Multi-Turn Cleanup task for spoken conversational transcripts and collecting a new dataset, MultiTurnCleanup(1). We design a data labeling schema to collect the high-quality dataset and provide extensive data analysis. Furthermore, we leverage two modeling approaches for experimental evaluation as benchmarks for future research.
引用
收藏
页码:9895 / 9903
页数:9
相关论文
共 50 条
  • [1] An Empirical Analysis on Multi-turn Conversational Recommender Systems
    Zhang, Lu
    Li, Chen
    Lei, Yu
    Sun, Zhu
    Liu, Guanfeng
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 841 - 851
  • [2] System Initiative Prediction for Multi-turn Conversational Information Seeking
    Meng, Chuan
    Aliannejadi, Mohammad
    de Rijke, Maarten
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 1807 - 1817
  • [3] System Initiative Prediction for Multi-turn Conversational Information Seeking
    Meng, Chuan
    Aliannejadi, Mohammad
    de Rijke, Maarten
    International Conference on Information and Knowledge Management, Proceedings, 2023, : 1807 - 1817
  • [4] KNOWLEDGE AUGMENTED BERT MUTUAL NETWORK IN MULTI-TURN SPOKEN DIALOGUES
    Wu, Ting-Wei
    Juang, Biing-Hwang
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7487 - 7491
  • [5] LEARNING SEMANTIC-MATCHING ATTENTION FOR MULTI-TURN SPOKEN LANGUAGE UNDERSTANDING
    Jiang, Ming
    Cao, Kaiqiang
    Li, Pengfei
    Zhang, Min
    UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2021, 83 (01): : 175 - 186
  • [6] Learning semantic-matching attention for multi-turn spoken language understanding
    Jiang, Ming
    Cao, Kaiqiang
    Li, Pengfei
    Zhang, Min
    UPB Scientific Bulletin, Series C: Electrical Engineering and Computer Science, 2021, 83 (01): : 175 - 186
  • [7] UX Design and Evaluation on Conversational Bot Supporting Multi-Turn and Multi-Domain Dialogues
    Bang, Junseong
    Ahn, Sunghee
    2022 INTERNATIONAL CONFERENCE ON PLATFORM TECHNOLOGY AND SERVICE (PLATCON22), 2022, : 92 - 97
  • [8] Tri-level Joint Natural Language Understanding for Multi-turn Conversational Datasets
    Weld, Henry
    Hu, Sijia
    Long, Siqu
    Poon, Josiah
    Han, Soyeon Caren
    INTERSPEECH 2023, 2023, : 700 - 704
  • [9] A Simple but Effective Method to Incorporate Multi-turn Context with BERT for Conversational Machine Comprehension
    Ohsugi, Yasuhito
    Saito, Itsumi
    Nishida, Kyosuke
    Asano, Hisako
    Tomita, Junji
    NLP FOR CONVERSATIONAL AI, 2019, : 11 - 17
  • [10] Adaptive Global-Local Context Fusion for Multi-Turn Spoken Language Understanding
    Thanh Tran
    Wei, Kai
    Ruan, Weitong
    McGowan, Ross
    Susanj, Nathan
    Strimel, Grant P.
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 12622 - 12628