Text Error Correction Method in the Construction Industry Based on Transfer Learning

被引:1
作者
Hou, Zhenguo [1 ]
Yang, Weitao [1 ]
He, Haiying [1 ]
Zhang, Peicong [1 ]
Wang, Ziyu [2 ]
Ji, Xiaosheng [3 ]
机构
[1] China Construct Seventh Engn Bur Co Ltd, Zhengzhou 450000, Henan, Peoples R China
[2] Hohai Univ, Ind Technol Res Inst, Changzhou 213022, Jiangsu, Peoples R China
[3] Hohai Univ, Coll IoT Engn, Changzhou 213022, Jiangsu, Peoples R China
来源
COMMUNICATIONS AND NETWORKING (CHINACOM 2021) | 2022年
关键词
Text error correction; Transfer learning; BERT model; Multi-domain text;
D O I
10.1007/978-3-030-99200-2_22
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Text error correction is of great value in the review of texts in the construction industry. For construction industry texts, which are compound texts with multi-domain proper nouns, the lack of labeled data leads to poor error correction algorithms based on deep learning. For this reason, this paper proposes a text error correction method in the construction industry based on transfer learning. Based on the pre-trained BERT model, we transfer some parameters to the target error correction model after unsupervised training by unlabeled related field dataset, and then retrain the model through the training samples of the construction document corpus dataset to obtain better error correction effects. Meanwhile, we dynamically adjust the pre-training task in transfer learning to improve the performance of the word order correction task. Experimental results show that our proposed model has higher precision rate, recall rate and lower false positive rate in the error correction task than other models.
引用
收藏
页码:277 / 290
页数:14
相关论文
共 15 条
[1]   COMPARATIVE ANALYSES OF BERT, ROBERTA, DISTILBERT, AND XLNET FOR TEXT-BASED EMOTION RECOGNITION [J].
Adoma, Acheampong Francisca ;
Henry, Nunoo-Mensah ;
Chen, Wenyu .
2020 17TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2020, :117-121
[2]  
Duan J., 2016, NEW TECHNOL LIB INF, V2, P34
[3]   Discriminative Fisher Embedding Dictionary Transfer Learning for Object Recognition [J].
Fan, Zizhu ;
Shi, Linrui ;
Liu, Qiang ;
Li, Zhengming ;
Zhang, Zheng .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (01) :64-78
[4]  
Fu K., 2018, CCF INT C NATURAL LA
[5]  
Huan Liang, 2019, 2019 IEEE 19th International Conference on Communication Technology (ICCT), P1516, DOI 10.1109/ICCT46805.2019.8947072
[6]  
Jing MX, 2019, Arxiv, DOI arXiv:1805.04686
[7]  
Kaliyar RK, 2020, PROCEEDINGS OF THE CONFLUENCE 2020: 10TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING, P336, DOI [10.1109/Confluence47617.2020.9058044, 10.1109/confluence47617.2020.9058044]
[8]  
Li Q., 2021, YANGTZE RIVER INFORM
[9]  
Mahima Y., 2020, 2020 12th International Conference on Advanced Infocomm Technology (ICAIT), P119, DOI 10.1109/ICAIT51223.2020.9315570
[10]  
Singh Shashank, 2018, 2018 Second International Conference on Electronics, Communication and Aerospace Technology (ICECA), P1076, DOI 10.1109/ICECA.2018.8474700