Application of Data Mining Methods in Internet of Things Technology for the Translation Systems in Traditional Ethnic Books

被引:8
作者
Luo, Yujing [1 ,2 ]
Xiang, Yueting [3 ]
机构
[1] Southwest Minzu Univ, Inst Southwest Minzu Res, Chengdu 610041, Peoples R China
[2] Sichuan Int Studies Univ, Coll Translat & Interpreting, Chongqing 400031, Peoples R China
[3] Xichang Univ, Sch Foreign Languages & Culture, Xichang 615000, Peoples R China
关键词
Internet of Things; traditional national classic; Corpora; translation; data mining; MACHINE TRANSLATION; ADAPTATION;
D O I
10.1109/ACCESS.2020.2994551
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In order to translate the ethnic classics, based on the research on the Internet of things, machine learning, and translation technology of ethnic classics, the log-linear model is combined with the national corpus scale and the grammatical structure characteristics, and the phrase statistical machine translation is used to establish a discontinuous phrase extraction model. Then, the translation technology is studied from the three aspects of model definition, training, and decoding. Finally, the algorithm is compared with the traditional phrase extraction algorithm to verify its effectiveness. The results show that the extraction number of discontinuous phrase extraction model is significantly higher than that of traditional phrase extraction model, and the model can extract more phrases, handle larger and more complex text, and score higher in translation fluency. From the evaluation indexes scores of Bilingual Evaluation Understudy (B.L.E.U.) and National Institute of Standards and Technology (N.I.S.T.), it can be found that the B.L.E.U. and N.I.S.T. values of the traditional phrase extraction algorithm are lower than those of the discontinuous phrase extraction model algorithm. The discontinuous phrase extraction algorithm can not only extract the regular continuous phrase, but also obtain the discontinuous text, and the translation effect is better. In conclusion, the combination of Internet of things and machine learning can be used in the translation of ethnic classics to achieve high-quality translation of discontinuous phrases, which is of guiding significance for the study of machine translation.
引用
收藏
页码:93398 / 93407
页数:10
相关论文
共 31 条
[1]  
[Anonymous], 2017, ARXIV170102901
[2]   Leveraging bilingual terminology to improve machine translation in a CAT environment [J].
Arcan, Mihael ;
Turchi, Marco ;
Tonelli, Sara ;
Buitelaar, Paul .
NATURAL LANGUAGE ENGINEERING, 2017, 23 (05) :763-788
[3]   Translation and validation of European Organization for Research and Treatment for Cancer quality of life questionnaire-OV-28 module into Indian languages (Hindi and Marathi) to study quality of life of ovarian cancer patients from a tertiary care cancer center [J].
Bajpai, Jyoti ;
Panda, Pankaj Kumar ;
Kagwade, Shraddha ;
Govilkar, Madhavi ;
Velaskar, Shruti ;
Kembhavi, Yogesh ;
Gupta, Sudeep ;
Ghosh, Jaya ;
Deodhar, Jayita .
SOUTH ASIAN JOURNAL OF CANCER, 2018, 7 (01) :37-41
[4]   Optimization Methods for Large-Scale Machine Learning [J].
Bottou, Leon ;
Curtis, Frank E. ;
Nocedal, Jorge .
SIAM REVIEW, 2018, 60 (02) :223-311
[5]  
Boyd-Graber J., 2017, ARXIV170707402
[6]   A simplification-translation-restoration framework for domain adaptation in statistical machine translation: A case study in medical record translation [J].
Chen, Han-Bin ;
Huang, Hen-Hsen ;
Hsieh, An-Chang ;
Chen, Hsin-Hsi .
COMPUTER SPEECH AND LANGUAGE, 2017, 42 :59-80
[7]   Discriminative ridge regression algorithm for adaptation in statistical machine translation [J].
Chinea-Rios, Mara ;
Sanchis-Trilles, German ;
Casacuberta, Francisco .
PATTERN ANALYSIS AND APPLICATIONS, 2019, 22 (04) :1293-1305
[8]   A preordering model based on phrasal dependency tree [J].
Farzi, Saeed ;
Faili, Heshaam ;
Kianian, Sahar .
DIGITAL SCHOLARSHIP IN THE HUMANITIES, 2018, 33 (04) :748-765
[9]   Incident-Driven Machine Translation and Name Tagging for Low-resource Languages [J].
Hermjakob, Ulf ;
Li, Qiang ;
Marcu, Daniel ;
May, Jonathan ;
Mielke, Sebastian J. ;
Pourdamghani, Nima ;
Pust, Michael ;
Shi, Xing ;
Knight, Kevin ;
Levinboim, Tomer ;
Murray, Kenton ;
Chiang, David ;
Zhang, Boliang ;
Pan, Xiaoman ;
Lu, Di ;
Lin, Ying ;
Ji, Heng .
MACHINE TRANSLATION, 2018, 32 (1-2) :59-89
[10]   Improving SIMD Parallelism via Dynamic Binary Translation [J].
Hong, Ding-Yong ;
Liu, Yu-Ping ;
Fu, Sheng-Yu ;
Wu, Jan-Jan ;
Hsu, Wei-Chung .
ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2018, 17 (03)