Automatic summarization of open-domain multiparty dialogues in diverse genres

被引:57
|
作者
Zechner, K [1 ]
机构
[1] Educ Testing Serv, Princeton, NJ 08541 USA
关键词
D O I
10.1162/089120102762671945
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic summarization of open-domain spoken dialogues is a relatively new research area. This article introduces the task and the challenges involved and motivates and presents an approach for obtaining automatic-extract summaries for human transcripts of multiparty dialogues of four different genres, without any restriction on domain. We address the following issues, which are intrinsic to spoken-dialogue summarization and typically can be ignored when summarizing written text such as news wire data: (1) detection and removal of speech disfluencies; (2) detection and insertion of sentence boundaries; and (3) detection and linking of cross-speaker information units (question-answer pairs). A system evaluation is performed using a corpus of 23 dialogue excerpts with an average duration of about 10 minutes, comprising 80 topical segments and about 47,000 words total. The corpus was manually annotated for relevant text spans by six human annotators. The global evaluation shows that for the two more informal genres, our summarization system using dialogue-specific components significantly outperforms two baselines: (1) a maximum-marginal-relevance ranking algorithm using TF*IDF term weighting, and (2) a LEAD baseline that extracts the first n words from a text.
引用
收藏
页码:447 / 485
页数:39
相关论文
共 50 条
  • [21] Personality prediction from task-oriented and open-domain human-machine dialogues
    Guo, Ao
    Hirai, Ryu
    Ohashi, Atsumoto
    Chiba, Yuya
    Tsunomori, Yuiko
    Higashinaka, Ryuichiro
    SCIENTIFIC REPORTS, 2024, 14 (01)
  • [22] Adapting Generative Pre-trained Language Model for Open-domain Multimodal Sentence Summarization
    Lin, Dengtian
    Jing, Liqiang
    Song, Xuemeng
    Liu, Meng
    Sun, Teng
    Nie, Liqiang
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 195 - 204
  • [23] Predictive Engagement: An Efficient Metric for Automatic Evaluation of Open-Domain Dialogue Systems
    Ghazarian, Sarik
    Weischedel, Ralph
    Galstyan, Aram
    Peng, Nanyun
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 7789 - 7796
  • [24] ACCENT: An Automatic Event Commonsense Evaluation Metric for Open-Domain Dialogue Systems
    Ghazarian, Sarik
    Shao, Yijia
    Han, Rujun
    Galstyan, Aram
    Peng, Nanyun
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 4398 - 4419
  • [25] PONE: A Novel Automatic Evaluation Metric for Open-domain Generative Dialogue Systems
    Lan, Tian
    Mao, Xian-Ling
    Wei, Wei
    Gao, Xiaoyan
    Huang, Heyan
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2020, 39 (01)
  • [26] Advances in open-domain question answering
    Zhang, Zhi-Chang
    Zhang, Yu
    Liu, Ting
    Li, Sheng
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2009, 37 (05): : 1058 - 1069
  • [27] vBLEu: Uncertainty-Aware Automatic Evaluation Method for Open-Domain Dialogue Systems
    Tsuta, Yuma
    Yoshinaga, Naoki
    Toyoda, Masashi
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020): STUDENT RESEARCH WORKSHOP, 2020, : 199 - 206
  • [28] Recipes for Building an Open-Domain Chatbot
    Roller, Stephen
    Dinan, Emily
    Goyal, Naman
    Ju, Da
    Williamson, Mary
    Liu, Yinhan
    Xu, Jing
    Ott, Myle
    Shuster, Kurt
    Smith, Eric M.
    Boureau, Y-Lan
    Weston, Jason
    16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 300 - 325
  • [29] Unsupervised Open-domain Keyphrase Generation
    Lam Thanh Do
    Akash, Pritom Saha
    Chang, Kevin Chen-Chuan
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 10614 - 10627
  • [30] Towards Open-Domain Topic Classification
    Ding, Hantian
    Yang, Jinrui
    Deng, Yuqian
    Zhang, Hongming
    Roth, Dan
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES: PROCEEDINGS OF THE DEMONSTRATIONS SESSION, 2022, : 90 - 98