A Human Machine Interface Framework for Autonomous Vehicle Control

被引:0
作者
Nakagawa, Takuma [1 ]
Nishimura, Ryota [1 ]
Iribe, Yurie [2 ]
Ishiguro, Yoshio [3 ]
Ohsuga, Shin [4 ]
Kitaoka, Norihide [1 ]
机构
[1] Tokushima Univ, Tokushima, Japan
[2] Aichi Prefectural Univ, Nagakute, Aichi, Japan
[3] Nagoya Univ, Nagoya, Aichi, Japan
[4] Aisin Seiki Co Ltd, Kariya, Aichi, Japan
来源
2017 IEEE 6TH GLOBAL CONFERENCE ON CONSUMER ELECTRONICS (GCCE) | 2017年
基金
日本科学技术振兴机构;
关键词
Multimodal interaction; speech; gesture; eye-gaze; finite state transducer; autonomous vehicle;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The recent development of autonomous vehicles has attracted much attention. The technologies for persons without technical knowledge to handle autonomous vehicles easily are highly demanded. We are developing an intuitive multimodal interface system using speech, gesture, and eye-gaze recognition. We designed a multimodal understanding component and dialog control component of the interface system separately using finite state transducers. As for implementation, conventional dialogue systems are controlled by a finite state transducer with user actions as inputs and system actions as outputs. Our multimodal understanding and dialog control component can be seen as a cascade of two separate transducers. Cascaded transducers can be composed to one transducer. Our system can operate a virtual car in an autonomous car simulator.
引用
收藏
页数:3
相关论文
共 9 条
[1]  
Akinobu Lee, 2013, AC SPEECH SIGN PROC
[2]  
Bolt R. A., 1980, Computer Graphics, V14, P262, DOI 10.1145/965105.807503
[3]  
Johnston M., 1998, P 36 ANN M ASS COMPU, V1, P624, DOI DOI 10.3115/980845.980949
[4]  
Krahnstoever N., 2002, P 4 IEEE INT C MULT
[5]  
Lee A., 2001, EUROSPEECH, P1691
[6]  
Lee Akinobu, 2009, P APSIPA ASC 2009 AS
[7]   Improvement of multimodal gesture and speech recognition performance using time intervals between gestures and accompanying speech [J].
Miki, Madoka ;
Kitaoka, Norihide ;
Miyajima, Chiyomi ;
Nishino, Takanori ;
Takeda, Kazuya .
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2014,
[8]  
Potter LE, 2013, P 25 AUSTR COMP HUM, P175, DOI DOI 10.1145/2541016.2541072
[9]  
Stiefelhagen R., 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566), P2422