Adversarial training for named entity recognition of rail fault text

被引:0
作者
Qu, J. [1 ]
Su, S. [1 ,2 ]
Li, R. [1 ]
Wang, G. [3 ]
机构
[1] Beijing Jiaotong Univ, State Key Lab Traff Control & Safety, Beijing, Peoples R China
[2] Beijing Jiaotong Univ, Frontiers Sci Ctr Smart High Speed Railway Syst, Beijing, Peoples R China
[3] Rutgers State Univ, Dept Comp Sci, Piscataway, NJ 08854 USA
来源
2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC) | 2021年
关键词
Rail fault texts; Named entity recognition; Adversarial training;
D O I
10.1109/ITSC48978.2021.9565087
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
At present, most rail faults in metro systems are recorded in the form of text. Due to the lack of effective mining and analysis tools, information in the massive textual data is not fully utilized. Learning from past fault texts and identifying some key concepts are essential to analyze faults and help decision making. In this paper, a word-enhanced adversarial training model (AT-BiLSTM-CRF) is proposed to address this problem. In this model, the named entity recognition (NER) is achieved by bi-directional long short-term memory (BiLSTM) with conditional random field (CRF). At the same time, the Chinese word segmentation (CWS) task is introduced to conduct adversarial training with the NER task. The structure of adversarial training is to make full use of the boundary information and filter out the noise caused by introducing the CWS task. More importantly, the experiments on five different train fault datasets are conducted in the rail field. The results show that the model performs better than the state-of-the-art baselines, which indicates it has the potential to lay the foundation for textual data analysis in the rail field.
引用
收藏
页码:1353 / 1358
页数:6
相关论文
共 50 条
[41]   Named Entity Recognition Algorithms Comparison For Judicial Text Data [J].
Aibek, Kuralbayev ;
Bobur, Mukhsimbayev ;
Abay, Bekbaganbetov ;
Hajiyev, Fuad .
2020 IEEE 14TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT2020), 2020,
[42]   Persian Automatic Text Summarization Based on Named Entity Recognition [J].
Khademi, Mohammad Ebrahim ;
Fakhredanesh, Mohammad .
IRANIAN JOURNAL OF SCIENCE AND TECHNOLOGY-TRANSACTIONS OF ELECTRICAL ENGINEERING, 2020,
[43]   Named Entity Recognition in Vietnamese Text Using Label Propagation [J].
Huong Thanh Le ;
Rathany Chan Sam ;
Hoan Cong Nguyen ;
Thuy Thanh Nguyen .
2013 INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR), 2013, :366-370
[44]   Object-Aware Multimodal Named Entity Recognition in Social Media Posts With Adversarial Learning [J].
Zheng, Changmeng ;
Wu, Zhiwei ;
Wang, Tao ;
Cai, Yi ;
Li, Qing .
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 :2520-2532
[45]   Chinese Named Entity Recognition Method for Domain-Specific Text [J].
Liu, He ;
Ma, Yuekun ;
Gao, Chang ;
Jia, Qi ;
Zhang, Dezheng .
TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2023, 30 (06) :1799-1808
[46]   Automatic Text Summarization using Document Clustering Named Entity Recognition [J].
Selvan, R. . Senthamizh ;
Arutchelvan, K. .
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (09) :537-543
[47]   Radar technical language modeling with named entity recognition and text classification [J].
Zaunegger, Jackson S. ;
Singerman, Paul G. ;
Narayanan, Ram M. ;
O'Rourke, Sean M. ;
Rangaswamy, Muralidhar .
RADAR SENSOR TECHNOLOGY XXVI, 2022, 12108
[48]   A real time Named Entity Recognition system for Arabic text mining [J].
Al-Jumaily, Harith ;
Martinez, Paloma ;
Martinez-Fernandez, Jose L. ;
Van der Goot, Erik .
LANGUAGE RESOURCES AND EVALUATION, 2012, 46 (04) :543-563
[49]   Research on College Academic Text Named Entity Recognition and Dataset Construction [J].
He, Chen ;
Yuan, Yingchun ;
Wang, Kejian ;
Tao, Jia .
Computer Engineering and Applications, 2023, 59 (22) :322-328
[50]   Machine reading comprehension based named entity recognition for medical text [J].
Zhang, Ziqi ;
Zheng, Xiangwei ;
Zhang, Jinsong .
Multimedia Tools and Applications, 2025, 84 (28) :33431-33451