Improving Context Modeling in Neural Topic Segmentation

被引:0
作者
Xing, Linzi [1 ]
Hackinen, Brad [2 ]
Carenini, Giuseppe [1 ]
Trebbi, Francesco [3 ]
机构
[1] Univ British Columbia, Vancouver, BC, Canada
[2] Ivey Business Sch, London, ON, Canada
[3] Univ Calif Berkeley, Berkeley, CA 94720 USA
来源
1ST CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 10TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (AACL-IJCNLP 2020) | 2020年
关键词
TEXT;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Topic segmentation is critical in key NLP tasks and recent works favor highly effective neural supervised approaches. However, current neural solutions are arguably limited in how they model context. In this paper, we enhance a segmenter based on a hierarchical attention BiLSTM network to better model context, by adding a coherence-related auxiliary task and restricted self-attention. Our optimized segmenter(1) outperforms SOTA approaches when trained and tested on three datasets. We also the robustness of our proposed model in domain transfer setting by training a model on a large-scale dataset and testing it on four challenging real-world benchmarks. Furthermore, we apply our proposed strategy to two other languages (German and Chinese), and show its effectiveness in multilingual scenarios.
引用
收藏
页码:626 / 636
页数:11
相关论文
共 43 条
[1]   SECTOR: A Neural Model for Coherent Topic Segmentation and Classification [J].
Arnold, Sebastian ;
Schneider, Rudolf ;
Cudre-Mauroux, Philippe ;
Ger, Felix A. ;
Loeser, Alexander .
TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2019, 7 :169-184
[2]   Attention-Based Neural Text Segmentation [J].
Badjatiya, Pinkesh ;
Kurisinkel, Litton J. ;
Gupta, Manish ;
Varma, Vasudeva .
ADVANCES IN INFORMATION RETRIEVAL (ECIR 2018), 2018, 10772 :180-193
[3]  
Barrow J, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), P313
[4]  
Barzilay R., 2005, P 43 ANN M SOC COMP, P141, DOI 10.3115/1219840.1219858
[5]   Modeling local coherence: An entity-based approach [J].
Barzilay, Regina ;
Lapata, Mirella .
COMPUTATIONAL LINGUISTICS, 2008, 34 (01) :1-34
[6]   Statistical models for text segmentation [J].
Beeferman, D ;
Berger, A ;
Lafferty, J .
MACHINE LEARNING, 1999, 34 (1-3) :177-210
[7]  
Bertrand M., 2018, Hall of Mirrors: Corporate Philanthropy and Strategic Advocacy
[8]  
Chen H, 2009, P HUM LANG TECHN 200, P371, DOI DOI 10.5555/1620754.1620808
[9]  
Choi FYY, 2000, 6TH APPLIED NATURAL LANGUAGE PROCESSING CONFERENCE/1ST MEETING OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE AND PROCEEDINGS OF THE ANLP-NAACL 2000 STUDENT RESEARCH WORKSHOP, pA26
[10]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171