Towards the implementation of an Attention-based Neural Machine Translation with artificial pronunciation for Nahuatl as a mobile application

被引:1
作者
Bello Garcia, Sergio Khalil [1 ]
Sanchez Lucero, Eduardo [1 ]
Pedroza Mendez, Blanca Estela [1 ]
Hernandez Hernandez, Jose Crispin [1 ]
Bonilla Huerta, Edmundo [1 ]
Ramirez Cruz, Jose Federico [1 ]
机构
[1] Inst Tecnol Apizaco ITA, Div Estudios Posgrad & Invest, Tecnol Nacl Mexico TecNM, Tlaxcala, Mexico
来源
2020 8TH EDITION OF THE INTERNATIONAL CONFERENCE IN SOFTWARE ENGINEERING RESEARCH AND INNOVATION (CONISOFT 2020) | 2020年
关键词
Nahuatl; NMT; mobile; translation; attention; machine learning; CoreML; neural network; Mel spectrogram;
D O I
10.1109/CONISOFT50191.2020.00041
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
There are great translation systems online. However, even though this technology is available for the majority of languages, it is not the case of Nahuatl [1]. For this reason, this paper outlines a master's degree thesis proposal which is aimed to use a Neural Network for Translation with an attention mechanism and Long Short-Term Memory (LSTM) like the one used by Google [2]. In addition, it seeks to implement an artificial Text To Speech (TTS) system trained with a Neural Network with a given dataset of Mel spectrograms in [3] from a person speaking in Nahuatl and it attempts to achieve a natural voice output as a spectrogram, then process it and obtain the sound desired. Finally, once trained, these models can be prepared for being used within mobile devices, and even taking advantage of the neural engine some of them are equipped with. In this way, this technology can reach more people and help to preserve and even spread the language. The early results showed how the limited resources of this language could cause a strong bias in the outputs and also how there could be some loss of information given the morphemes Nahuatl has, given its polysynthetic nature. This also highlights the way it can be tokenized, playing an important role in how the results turn out obtaining a BLEU score of 0.34 at best. Finally, this application and research can be an interesting framework of how a polysynthetic language can be manipulated to be used for fusional languages like Spanish or English. This research work was carried out at the "Tecnologico Nacional de Mexico" (TecNM), campus of the "Instituto Tecnologico de Apizaco" (ITA)
引用
收藏
页码:235 / 244
页数:10
相关论文
共 28 条
[1]  
A. Inc, 2019, CONV TRAIN MOD COR M CONV TRAIN MOD COR M
[2]  
A. Inc, 2020, LANG SUPP LANG SUPP
[3]  
[Anonymous], 2019, LANGUAGE MODELS ARE
[4]  
[Anonymous], 2010, Commun. Surveys Tuts., DOI DOI 10.1038/nature14539
[5]  
[Anonymous], 2016, P 9 ISCA WORKSH SPEE
[6]  
ARM, ARM ARCHITECTURE REF, P32
[7]  
Bahdanau D, 2014, 3 INT C LEARN REPR
[8]  
Banerjee D., 2018, A microarchitectural study on apple's a11 bionic processor
[9]  
Carolina E., 2015, RECOPILACION CORPUS
[10]  
Choi H., 2018, ABS180311407 CORR ABS180311407 CORR, P11407