Speech Technologies for Advanced Applications in Service Robotics

被引:0
作者
Ondas, Stanislav [1 ]
Juhar, Jozef [1 ]
Pleva, Matus [1 ]
Lojka, Martin [1 ]
Kiktova, Eva [1 ]
Sulir, Martin [1 ]
Cizmar, Anton [1 ]
Holcer, Roland [2 ]
机构
[1] Tech Univ Kosice, FEI, Dept Elect & Multimedia Commun, Kosice 04120, Slovakia
[2] Res Dev Design & Supply Co, ZTS VVU KOSICE As, Kosice 04124, Slovakia
关键词
service robots; speech technologies; speech recognition; speech synthesis; multimodal interface; ACOUSTIC EVENT DETECTION; CLASSIFICATION;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The multimodal interface for controlling functions of the complex modular robotic system, which can be deployed in difficult conditions as are rescue works, natural disasters, fires, decontamination purposes was designed. Such interface involves several fundamental technologies such as speech recognition, speech synthesis and dialogue management. To enable human operator to cooperate with designed robotic system, the sophisticated architecture was designed and described technologies were implemented. The automatic speech recognition system is introduced, which is based on Hidden Markov models and enables to control functions of the system using a set of voice commands. The text-to-speech system was prepared for producing feedback to the operator and dialogue manager technology was adopted, which makes it possible to perform the information exchange between operator and robotic system. The system proposed is enriched with acoustic event detection system, which consists of a set of five microphones integrated on the robotic vehicle, the post-processing unit and detection unit.
引用
收藏
页码:45 / 61
页数:17
相关论文
共 50 条
[41]   Advanced Rich Transcription System for Estonian Speech [J].
Alumae, Tanel ;
Tilk, Ottokar ;
Asadullah .
HUMAN LANGUAGE TECHNOLOGIES - THE BALTIC PERSPECTIVE, BALTIC HLT 2018, 2018, 307 :1-8
[42]   SIMULATING DYSARTHRIC SPEECH FOR TRAINING DATA AUGMENTATION IN CLINICAL SPEECH APPLICATIONS [J].
Jiao, Yishan ;
Tu, Ming ;
Berisha, Visar ;
Liss, Julie .
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, :6009-6013
[43]   HMM/SVM segmentation and labelling of Arabic speech for speech recognition applications [J].
Frihia H. ;
Bahi H. .
International Journal of Speech Technology, 2017, 20 (03) :563-573
[44]   Optimization of stammering in speech recognition applications [J].
Mishra, Nishant ;
Gupta, Akash ;
Vathana, D. .
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2021, 24 (03) :679-685
[45]   Search Based Applications for Speech Processing [J].
Suciu, George ;
Dobre, Robert Alexandra ;
Butca, Cristina ;
Suciu, Victor ;
Mihaila, Ioana ;
Cheveresan, Romulus .
2016 8TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTERS AND ARTIFICIAL INTELLIGENCE (ECAI), 2016,
[46]   Optimization of stammering in speech recognition applications [J].
Nishant Mishra ;
Akash Gupta ;
D. Vathana .
International Journal of Speech Technology, 2021, 24 :679-685
[47]   Review of analysis methods for speech applications [J].
O'Shaughnessy, Douglas .
SPEECH COMMUNICATION, 2023, 151 :64-75
[48]   Generating expressive speech for storytelling applications [J].
Theune, Mariet ;
Meijs, Koen ;
Heylen, Dirk ;
Ordelman, Roeland .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (04) :1137-1144
[49]   Applications of speech recognition for Romanian language [J].
Chivu, Catalin .
ADVANCES IN ELECTRICAL AND COMPUTER ENGINEERING, 2007, 7 (01) :29-33
[50]   Crop HTP Technologies: Applications and Prospects [J].
He, Shuyuan ;
Li, Xiuni ;
Chen, Menggen ;
Xu, Xiangyao ;
Tang, Fenda ;
Gong, Tao ;
Xu, Mei ;
Yang, Wenyu ;
Liu, Weiguo .
AGRICULTURE-BASEL, 2024, 14 (05)