An AAC Application for Generating Japanese Response Phrases Using GPT-4

被引:0
作者
Kitayama, Suzuna [1 ]
Hirotomi, Tetsuya [2 ]
机构
[1] Shimane Univ, Interdisciplinary Fac Sci & Engn, 1060 Nishikawatsu Cho, Matsue, Shimane 6908504, Japan
[2] Shimane Univ, Inst Sci & Engn, Acad Assembly, 1060 Nishikawatsu Cho, Matsue, Shimane 6908504, Japan
来源
COMPUTERS HELPING PEOPLE WITH SPECIAL NEEDS, PT II, ICCHP 2024 | 2024年 / 14751卷
关键词
Augmented and Alternative Communication (AAC); GPT-4; Cerebral Palsy; Response generation;
D O I
10.1007/978-3-031-62849-8_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As augmentative and alternative communication (AAC) users increasingly share their anecdotes, experiences, and jokes, communication with them can become more interactive. We developed a new AAC application that uses speech recognition and the Generative Pre-trained Transformer 4 (GPT-4) to generate four types of Japanese response phrases: boke, tsukkomi, neutral, and backchannel phrases, where boke and tsukkomi phrases are known as Japanese jokes often used in "manzai." A 22-year-old Japanese man with cerebral palsy participated in the development as an AAC user. We conducted a demonstration case study of four sessions with him, each lasting 20 to 30 min. During these sessions, our application was used in peer-to-peer conversations with coevals and opinions for further improvements were solicited. The results were summarized as follows: (1) Our application could generate four types of Japanese phrases in a mean duration of 5.37 s (SD 2.47) after the partner's utterance; (2) The participant selected one tsukkomi, two boke, and four neutral phrases. The mean duration to produce the selected utterance from the end of the partner's utterance was 17.52 s (SD 14.71); (3) When the participant presented the generated Japanese jokes, the participant laughed and nodded at the punctuation pause or at the end of the speech output.
引用
收藏
页码:144 / 152
页数:9
相关论文
共 12 条
[1]  
Agarwal S., 2024, Gpt-4 technical report
[2]  
Beukelman DR., 2020, Augmentative and alternative communication: supporting children and adults with complex communication needs, V5th ed.
[3]  
Cook A.M., 2001, ASSISTIVE TECHNOLOGI, V2nd
[4]  
Dean Ruth Anne Kinsman, 2004, Palliat Support Care, V2, P139
[5]  
Dybala P., 2010, PUNDA numbears: proposal of goroawase generating system for Japanese, P345
[6]  
Go K., 2023, Trans. Virtual Reality Soc. Jpn., V28, P43, DOI [10.18974/tvrsj.28.143, DOI 10.18974/TVRSJ.28.143]
[7]  
International Society for Augmentative and Alternative Communication, 2014, About AAC?
[8]  
Lazar J., 2017, Research Methods in Human Computer Interaction, V7, P153
[9]   KWickChat: A Multi-Turn Dialogue System for AAC Using Context-Aware Sentence Generation by Bag-of-Keywords [J].
Shen, Junxiao ;
Yang, Boyin ;
Dudley, John ;
Kristensson, Per Ola .
IUI'22: 27TH INTERNATIONAL CONFERENCE ON INTELLIGENT USER INTERFACES, 2022, :853-867
[10]   Whole utterance approaches in AAC [J].
Todman, John ;
Alm, Norman ;
Higginbotham, Jeff ;
File, Portia .
AUGMENTATIVE AND ALTERNATIVE COMMUNICATION, 2008, 24 (03) :235-254