Speech and Hands-free Interaction: Myths, Challenges, and Opportunities

被引:0
|
作者
Munteanu, Cosmin [1 ]
Penn, Gerald [2 ]
机构
[1] Univ Toronto Mississauga, Inst Commun Culture Informat & Technol, Mississauga, ON, Canada
[2] Univ Toronto, Dept Comp Sci, Toronto, ON, Canada
来源
PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON HUMAN-COMPUTER INTERACTION WITH MOBILE DEVICES AND SERVICES (MOBILEHCI '17) | 2017年
关键词
H. 5.2 [User interfaces]: Voice I/O; Natural language; User-centered design; and Evaluation/methodology;
D O I
10.1145/3098279.3119919
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
HCI research has for long been dedicated to better and more naturally facilitating information transfer between humans and machines. Unfortunately, humans' most natural form of communication, speech, is also one of the most difficult modalities to be understood by machines -despite, and perhaps, because it is the highest-bandwidth communication channel we possess. While significant research efforts, from engineering, to linguistic, and to cognitive sciences, have been spent on improving machines' ability to understand speech, the MobileHCI community (and the HCI field at large) has been relatively timid in embracing this modality as a central focus of research. This can be attributed in part to the unexpected variations in error rates when processing speech, in contrast with often-unfounded claims of success from industry, but also to the intrinsic difficulty of designing and especially evaluating speech and natural language interfaces. As such, the development of interactive speech-based systems is mostly driven by engineering efforts to improve such systems with respect to largely arbitrary performance metrics. Such developments have often been void of any user-centered design principles or consideration for usability or usefulness. The goal of this course is to inform the MobileHCI community of the current state of speech and natural language research, to dispel some of the myths surrounding speech-based interaction, as well as to provide an opportunity for researchers and practitioners to learn more about how speech recognition and speech synthesis work, what are their limitations, and how they could be used to enhance current interaction paradigms. Through this, we hope that HCI researchers and practitioners will learn how to combine recent advances in speech processing with user-centred principles in designing more usable and useful speechbased interactive systems.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] Speech and Hands-free Interaction: Myths, Challenges, and Opportunities
    Munteanu, Cosmin
    Penn, Gerald
    CHI 2018: EXTENDED ABSTRACTS OF THE 2018 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2018,
  • [2] Speech-based Interaction: Myths, Challenges, and Opportunities
    Munteanu, Cosmin
    Penn, Gerald
    PROCEEDINGS OF THE 16TH ACM INTERNATIONAL CONFERENCE ON HUMAN-COMPUTER INTERACTION WITH MOBILE DEVICES AND SERVICES (MOBILEHCI'14), 2014, : 567 - 568
  • [3] Speech enhancement for hands-free terminals
    Grbic, N
    Nordholm, S
    Johansson, A
    ISPA 2001: PROCEEDINGS OF THE 2ND INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS, 2001, : 435 - 440
  • [4] Fast dereverberation for hands-free speech recognition
    Gomez, Randy
    Even, Jani
    Saruwatari, Hiroshi
    Shikano, Kiyohiro
    2008 HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS, 2008, : 141 - +
  • [5] Hands-free interaction with a computer and other technologies
    Marcela Fejtová
    Luis Figueiredo
    Petr Novák
    Olga Štěpánková
    Ana Gomes
    Universal Access in the Information Society, 2009, 8 : 277 - 295
  • [6] Hands-free interaction with a computer and other technologies
    Fejtova, Marcela
    Figueiredo, Luis
    Novak, Petr
    Stepankova, Olga
    Gomes, Ana
    UNIVERSAL ACCESS IN THE INFORMATION SOCIETY, 2009, 8 (04) : 277 - 295
  • [7] HANDS-FREE SPEECH-SOUND INTERACTIONS AT HOME
    Milhorat, P.
    Istrate, D.
    Boudy, J.
    Chollet, G.
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 1678 - 1682
  • [8] Hands-free
    Gaudenzi, Daniela
    PONTE, 2011, 67 (09) : 11 - 13
  • [9] A robust speech detection algorithm for speech activated hands-free applications
    Wu, D
    Tanaka, M
    Chen, R
    Olorenshaw, L
    Amador, M
    Menendez-Pidal, X
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 2407 - 2410
  • [10] Soft constrained subband beamforming for hands-free speech enhancement
    Grbic, N
    Nordholm, S
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 885 - 888