Cross-Lingual Transfer Learning for Medical Named Entity Recognition

被引:2
作者
Ding, Pengjie [1 ,2 ]
Wang, Lei [2 ]
Liang, Yaobo [3 ]
Lu, Wei [1 ]
Li, Linfeng [2 ,4 ]
Wang, Chun [6 ]
Tang, Buzhou [5 ]
Yan, Jun [2 ]
机构
[1] Renmin Univ China, Sch Informat & DEKE, Beijing, Peoples R China
[2] Yidu Cloud Beijing Technol Co Ltd, Beijing, Peoples R China
[3] Microsoft Res Asia, Beijing, Peoples R China
[4] Beijing Jiaotong Univ, Inst Informat Sci, Beijing, Peoples R China
[5] Harbin Inst Technol, Shenzhen, Peoples R China
[6] China Med Univ, Dept Cardiac Surg, Hosp 1, Shenyang, Peoples R China
来源
DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2020), PT I | 2020年 / 12112卷
关键词
Transfer learning; Cross-lingual pretraining; Word embedding alignment; Medical terminology systems; Medical NER; INFORMATION EXTRACTION; SYSTEM;
D O I
10.1007/978-3-030-59410-7_28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Extensive technologies have been employed to explore a best way for cross-lingual transfer learning. In medical domain, Named Entity Recognition is pivotal for many downstream tasks, such as medical entity linking and clinical decision support systems. Nevertheless, the lack of annotation limits the applicability in many languages without enough labeled data. To alleviate this issue and make use of languages with sufficient annotated data, we find a new way to obtain medical parallel corpus from medical terminology systems and knowledge bases and propose a methodology which combines cross-lingual language model pretraining and bilingual word embedding alignment with the help of the parallel corpus. Moreover, our combined architecture which maintains the framework of pretrained model can not only be used for NER task but also other downstream NLP tasks. Experiments demonstrated that incorporating Chinese and English medical data can effectively improve the performance for an English medical NER dataset (i2b2).
引用
收藏
页码:403 / 418
页数:16
相关论文
共 50 条
[21]   Transfer Learning for Arabic Named Entity Recognition With Deep Neural Networks [J].
Al-Smadi, Mohammad ;
Al-Zboon, Saad ;
Jararweh, Yaser ;
Juola, Patrick .
IEEE ACCESS, 2020, 8 :37736-37745
[22]   A Research Toward Chinese Named Entity Recognition Based on Transfer Learning [J].
Hui Kang ;
Jingwu Xiao ;
Yunpeng Zhang ;
Lei Zhang ;
Xu Zhao ;
Tie Feng .
International Journal of Computational Intelligence Systems, 16
[23]   A Research Toward Chinese Named Entity Recognition Based on Transfer Learning [J].
Kang, Hui ;
Xiao, Jingwu ;
Zhang, Yunpeng ;
Zhang, Lei ;
Zhao, Xu ;
Feng, Tie .
INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2023, 16 (01)
[24]   Transfer Learning for Domain-Specific Named Entity Recognition in German [J].
Torge, Sunna ;
Hahn, Waldemar ;
Jaekel, Rene .
2020 6TH IEEE CONGRESS ON INFORMATION SCIENCE AND TECHNOLOGY (IEEE CIST'20), 2020, :321-327
[25]   Dynamically Transfer Entity Span Information for Cross-domain Chinese Named Entity Recognition [J].
Wu B.-C. ;
Deng C.-L. ;
Guan B. ;
Chen X.-L. ;
Zan D.-G. ;
Chang Z.-J. ;
Xiao Z.-Y. ;
Qu D.-C. ;
Wang Y.-J. .
Ruan Jian Xue Bao/Journal of Software, 2022, 33 (10) :3776-3792
[26]   Domain Mismatch Doesn't Always Prevent Cross-Lingual Transfer Learning [J].
Edmiston, Daniel ;
Keung, Phillip ;
Smith, Noah A. .
LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, :892-899
[27]   Cross-lingual transfer learning during supervised training in low resource scenarios [J].
Das, Amit ;
Hasegawa-Johnson, Mark .
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, :3531-3535
[28]   Cross-lingual learning for text processing: A survey [J].
Pikuliak, Matus ;
Simko, Marian ;
Bielikova, Maria .
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 165
[29]   Pivot-based Candidate Retrieval for Cross-lingual Entity Linking [J].
Liu, Qian ;
Geng, Xiubo ;
Lu, Jie ;
Jiang, Daxin .
PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021), 2021, :1076-1085
[30]   TL-NER: A Transfer Learning Model for Chinese Named Entity Recognition [J].
Peng, DunLu ;
Wang, YinRui ;
Liu, Cong ;
Chen, Zhang .
INFORMATION SYSTEMS FRONTIERS, 2020, 22 (06) :1291-1304