Evaluating conversational recommender systems A landscape of research

被引:17
作者
Jannach, Dietmar [1 ]
机构
[1] Univ Klagenfurt, Klagenfurt Am Worthersee, Austria
关键词
Conversational recommender systems; Dialogue systems; Interactive systems; Evaluation; E-COMMERCE; GENERATION; QUALITY; MODEL;
D O I
10.1007/s10462-022-10229-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Conversational recommender systems aim to interactively support online users in their information search and decision-making processes in an intuitive way. With the latest advances in voice-controlled devices, natural language processing, and AI in general, such systems received increased attention in recent years. Technically, conversational recommenders are usually complex multi-component applications and often consist of multiple machine learning models and a natural language user interface. Evaluating such a complex system in a holistic way can therefore be challenging, as it requires (i) the assessment of the quality of the different learning components, and (ii) the quality perception of the system as a whole by users. Thus, a mixed methods approach is often required, which may combine objective (computational) and subjective (perception-oriented) evaluation techniques. In this paper, we review common evaluation approaches for conversational recommender systems, identify possible limitations, and outline future directions towards more holistic evaluation practices.
引用
收藏
页码:2365 / 2400
页数:36
相关论文
共 116 条
[1]   Multistakeholder recommendation: Survey and research directions [J].
Abdollahpouri, Himan ;
Adomavicius, Gediminas ;
Burke, Robin ;
Guy, Ido ;
Jannach, Dietmar ;
Kamishima, Toshihiro ;
Krasnodebski, Jan ;
Pizzato, Luiz .
USER MODELING AND USER-ADAPTED INTERACTION, 2020, 30 (01) :127-158
[2]   Effects of Online Recommendations on Consumers' Willingness to Pay [J].
Adomavicius, Gediminas ;
Bockstedt, Jesse C. ;
Curley, Shawn P. ;
Zhang, Jingjing .
INFORMATION SYSTEMS RESEARCH, 2018, 29 (01) :84-102
[3]  
Angara P.P., 2020, P 30 ANN INT C COMP, P247
[4]  
Argal A, 2018, 2018 IEEE 8TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), P176, DOI 10.1109/CCWC.2018.8301732
[5]   Resilient Chatbots: Repair Strategy Preferences for Conversational Breakdowns [J].
Ashktorab, Zahra ;
Jain, Mohit ;
Liao, Q. Vera ;
Weisz, Justin D. .
CHI 2019: PROCEEDINGS OF THE 2019 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2019,
[6]  
Averjanova Olga, 2008, Second International Conference on Mobile Ubiquitous Computing, Systems, Services and Technologies 2008, P212, DOI 10.1109/UBICOMM.2008.16
[7]  
Baizal ZA, 2017, INF COMM TECHN ICOIC, P1, DOI DOI 10.1109/ICOICT.2017.8074656
[8]  
Balaraman V, 2021, SIGDIAL 2021: 22ND ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2021), P239
[9]  
Burke R, 1999, SIXTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-99)/ELEVENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE (IAAI-99), P844
[10]  
Burke RD, 1996, PROCEEDINGS OF THE THIRTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE EIGHTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE, VOLS 1 AND 2, P462