The Power of AI-Generated Voices: How Digital Vocal Tract Length Shapes Product Congruency and Ad Performance

被引:16
作者
Efthymiou, Fotis [1 ]
Hildebrand, Christian [2 ]
de Bellis, Emanuel [3 ]
Hampton, William H. [4 ]
机构
[1] Univ St Gallen, Inst Behav Sci & Technol, St Gallen, Switzerland
[2] Univ St Gallen, Inst Behav Sci & Technol, Mkt Analyt, St Gallen, Switzerland
[3] Univ St Gallen, Inst Behav Sci & Technol, Empir Res Methods, St Gallen, Switzerland
[4] TikTok, San Jose, CA USA
关键词
sound symbolism; vocal tract length; speech synthesis; conversational agent; voice-product congruency; voice marketing; artificial intelligence; BODY-SIZE; FORMANT FREQUENCIES; EVOLUTION; HEIGHT; GENDER; PITCH; FOOD; COMMUNICATION; PREFERENCES; LISTENERS;
D O I
10.1177/10949968231194905
中图分类号
F [经济];
学科分类号
02 ;
摘要
Can AI-generated voices be designed to improve product and brand perceptions? Akin to human voices that evoke mental images in a listener even without visual cues, artificially generated voices can be intentionally designed to elicit envisioned mental representations. Drawing from prior work on sound symbolism and computational advances in speech synthesis, the authors explore how the voice of an AI-powered conversational agent (e.g., a voice assistant such as Amazon Alexa) impacts consumer perceptions and choice. Specifically, the authors examine how altering a conversational agent's digital vocal tract length (i.e., timbre) shapes consumers' physical ascriptions of the agent and subsequent voice-product congruency evaluations. Four experiments, including a large-scale field experiment, demonstrate that increasing (decreasing) the vocal tract length promotes congruency attributions toward stereotypically masculine (feminine) products and improves advertising performance (higher click-through rates and lower costs per click). This article represents a critical first step in deepening understanding on how artificially generated voices shape the consumer experience, demonstrating how firms could enhance product congruency perceptions and advertising performance by leveraging a more theory-driven approach to voice marketing.
引用
收藏
页码:117 / 134
页数:18
相关论文
共 101 条
[1]  
Abhang P.A., 2016, INTRO EEG AND SPEECH
[2]   LOUDNESS GROWTH IN 1/2-OCTAVE BANDS (LGOB) - A PROCEDURE FOR THE ASSESSMENT OF LOUDNESS [J].
ALLEN, JB ;
HALL, JL ;
JENG, PS .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1990, 88 (02) :745-753
[3]  
Alreck PamelaL., 1994, Journal of Product Brand Management, V3, P6, DOI [DOI 10.1108/10610429410053059, 10.1108/10610429410053059]
[4]  
Amazon, 2021, AM POLL DEV GUID
[5]   Now You've Got the Shiveries: Affect, Intimacy, and the ASMR Whisper Community [J].
Andersen, Joceline .
TELEVISION & NEW MEDIA, 2015, 16 (08) :683-700
[6]   Soundgen: An open-source tool for synthesizing nonverbal vocalizations [J].
Anikin, Andrey .
BEHAVIOR RESEARCH METHODS, 2019, 51 (02) :778-792
[7]   Understanding the Omnichannel Customer Journey: Determinants of Interaction Choice [J].
Barwitz, Niklas ;
Maas, Peter .
JOURNAL OF INTERACTIVE MARKETING, 2018, 43 :116-133
[8]   Machine Talk: How Verbal Embodiment in Conversational AI Shapes Consumer-Brand Relationships [J].
Bergner, Anouk S. ;
Hildebrand, Christian ;
Haubl, Gerald .
JOURNAL OF CONSUMER RESEARCH, 2023, 50 (04) :742-764
[9]  
Boersma Paul., 2021, I PHONETIC SCI U AMS
[10]  
Breazeal C, 2001, IROS 2001: PROCEEDINGS OF THE 2001 IEEE/RJS INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4, P1388, DOI 10.1109/IROS.2001.977175