MetaPETR: An Effective Model for Handling Class-Imbalanced Data About Event Temporal Relations

被引:1
作者
Zhang, Xiaobin [1 ,2 ]
Zang, Liangjun [1 ]
Liu, Qianwen [1 ,2 ]
Wei, Shuchong [1 ]
Hu, Songlin [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Informat Engn, Beijing 100085, Peoples R China
[2] Univ Chinese Acad Sci, Sch Cyber Secur, Beijing 100049, Peoples R China
来源
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT I, ICIC 2024 | 2024年 / 14875卷
关键词
Temporal Relation; Class Imbalance; Meta-Learning;
D O I
10.1007/978-981-97-5663-6_33
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Temporal relation (TempRel) extraction is an important task in natural language processing, but limited high-quality training data hinder its performance. Moreover, existing datasets suffer from the class imbalance problem, causing the model's insufficient training forminority class labels. To enhance the training process, we have developed a cloze-based prompt template model that incorporates meta-learning. Additionally, we have devised an attentive re-weighting sampling strategy that leverages the training attention under the self-adjusting dice loss function. This finding highlights that significant improvements can effectively improve the model's performance through reasonable data sampling strategies, appropriate training methods, and suitable loss functions. Across three predominant datasets in this field, our method demonstrates state-of-the-art performances on TB-Dense and MATRES. For TDDiscourse, we achieve top results on one subset and second-best on the other. Moreover, we observed that meta-learning effectively improves recall for minority class labels, while dice loss further enhances the precision across all labels.
引用
收藏
页码:390 / 401
页数:12
相关论文
共 23 条
[1]  
Cassidy T, 2014, PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, P501
[2]   Classifying Temporal Relations by Bidirectional LSTM over Dependency Paths [J].
Cheng, Fei ;
Miyao, Yusuke .
PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 2, 2017, :1-6
[3]  
Finn C, 2017, PR MACH LEARN RES, V70
[4]  
Han RJ, 2021, 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), P5367
[5]  
Huang QZ, 2023, PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, P9631
[6]  
Li XY, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), P465, DOI 10.1007/978-981-15-3863-6_51
[7]  
Liu J, 2021, PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, P3871
[8]  
Ma MD, 2021, 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES: DEMONSTRATIONS (NAACL-HLT 2021), P56
[9]  
Man H., 2022, AAAI C ART INT INT
[10]  
Mathur P, 2021, ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, P524