AwezaMed: A Multilingual, Multimodal Speech-To-Speech Translation Application for Maternal Health Care

被引:0
作者
Marais, Laurette [1 ]
Louw, Johannes A. [1 ]
Badenhorst, Jaco [1 ]
Calteaux, Karen [1 ]
Wilken, Ilana [1 ]
van Niekerk, Nina [1 ]
Stein, Glenn [2 ]
机构
[1] CSIR, Next Generat Enterprises & Inst, Digital Audio Visual Technol Res Grp, Pretoria, South Africa
[2] Aweza, Cape Town, South Africa
来源
PROCEEDINGS OF 2020 23RD INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION 2020) | 2020年
关键词
speech-to-speech translation; machine translation; automatic speech recognition; text-to-speech; mobile application;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The language contexts of multilingual developing countries such as South Africa are often characterised by communication challenges resulting from language differences. AwezaMed is a multilingual, multimodal speech-to-speech translation application for the health care domain, which was designed to assist in bridging communication barriers and mitigate the risks of miscommunication. The application focuses on the domain of maternal health care. It uses English as source language and Afrikaans, isiXhosa and isiZulu as target languages to enable health care providers to communicate with patients in their own language. It incorporates automatic speech recognition, machine translation and text-to-speech to deliver speech-to-speech translation functionality in a scalable way via a REST API to an Android mobile application. It is being piloted at various health care facilities across South Africa.
引用
收藏
页码:669 / 676
页数:8
相关论文
共 50 条
[21]   Applications of Language Modeling in Speech-To-Speech Translation [J].
Liu, Fu-Hua ;
Gu, Liang ;
Gao, Yuqing ;
Picheny, Michael .
International Journal of Speech Technology, 2004, 7 (2-3) :221-229
[22]   The IBM speech-to-speech translation system for smartphone: Improvements for resource-constrained tasks [J].
Zhou, Bowen ;
Cui, Xiaodong ;
Huang, Songfang ;
Cmejrek, Martin ;
Zhang, Wei ;
Xue, Jian ;
Cui, Jia ;
Xiang, Bing ;
Daggett, Gregg ;
Chaudhari, Upendra ;
Maskey, Sameer ;
Marcheret, Etienne .
COMPUTER SPEECH AND LANGUAGE, 2013, 27 (02) :592-618
[23]   NAME AWARE SPEECH-TO-SPEECH TRANSLATION FOR ENGLISH/IRAQI [J].
Prasad, Rohit ;
Moran, Christine ;
Choi, Fred ;
Meermeier, Ralf ;
Saleem, Shirin ;
Kao, Chia-lin ;
Stallard, Dave ;
Natarajan, Prem .
2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS, 2008, :249-252
[24]   NICT/ATR Chinese-Japanese-English Speech-to-Speech Translation System [J].
Shimizu, Tohru ;
Ashikari, Yutaka ;
Sumita, Eiichiro ;
Zhang, Jinsong ;
Nakamura, Satoshi .
Tsinghua Science and Technology, 2008, 13 (04) :540-544
[25]   TRANSFORMER-BASED DIRECT SPEECH-TO-SPEECH TRANSLATION WITH TRANSCODER [J].
Kano, Takatomo ;
Sakti, Sakriani ;
Nakamura, Satoshi .
2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, :958-965
[26]   NICT/ATR Chinese-Japanese-English Speech-to-Speech Translation System [J].
Tohru Shimizu ;
Yutaka Ashikari ;
Eiichiro Sumita ;
张劲松 ;
Satoshi Nakamura .
TsinghuaScienceandTechnology, 2008, (04) :540-544
[27]   Attempt Towards Stress Transfer in Speech-to-Speech Machine Translation [J].
Akarsh, Sai C. ;
Narasinga, Vamshiraghusimha ;
Mondal, Anindita ;
Vuppala, Anil .
2024 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS, SPCOM 2024, 2024,
[28]   Unsupervised training for Farsi-English speech-to-speech translation [J].
Xiang, Bing ;
Deng, Yonggang ;
Gao, Yuqing .
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, :4977-4980
[29]   Preserving Word-Level Emphasis in Speech-to-Speech Translation [J].
Quoc Truong Do ;
Toda, Tomoki ;
Neubig, Graham ;
Sakti, Sakriani ;
Nakamura, Satoshi .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (03) :544-556
[30]   A UNIFICATION-BASED JAPANESE PARSER FOR SPEECH-TO-SPEECH TRANSLATION [J].
NAGATA, M ;
MORIMOTO, T .
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1993, E76D (01) :51-61