Towards Music-Aware Virtual Assistants

被引:1
作者
Lindlbauer, David [1 ]
Wang, Alexander [1 ]
Donahue, Chris [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
来源
PROCEEDINGS OF THE 37TH ANNUAL ACM SYMPOSIUM ON USER INTERFACE SOFTWARE AND TECHNOLOGY, USIT 2024 | 2024年
关键词
Audio; Music; Virtual Assistants; Notifcation; Interruptions; Speech; Machine Learning; TEXT-TO-SPEECH;
D O I
10.1145/3654777.3676416
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a system for modifying spoken notifcations in a manner that is sensitive to the music a user is listening to. Spoken notifcations provide convenient access to rich information without the need for a screen. Virtual assistants see prevalent use in handsfree settings such as driving or exercising, activities where users also regularly enjoy listening to music. In such settings, virtual assistants will temporarily mute a user's music to improve intelligibility. However, users may perceive these interruptions as intrusive, negatively impacting their music-listening experience. To address this challenge, we propose the concept of music-aware virtual assistants, where speech notifcations are modifed to resemble a voice singing in harmony with the user's music. We contribute a system that processes user music and notifcation text to produce a blended mix, replacing original song lyrics with the notifcation content. In a user study comparing musical assistants to standard virtual assistants, participants expressed that musical assistants ft better with music, reduced intrusiveness, and provided a more delightful listening experience overall.
引用
收藏
页数:14
相关论文
共 57 条
[11]   CATCHING THE LYRICS: INTELLIGIBILITY IN TWELVE SONG GENRES [J].
Condit-Schultz, Nathaniel ;
Huron, David .
MUSIC PERCEPTION, 2015, 32 (05) :470-483
[12]   "What Can I Help You With?": Infrequent Users' Experiences of Intelligent Personal Assistants [J].
Cowan, Benjamin R. ;
Pantidi, Nadia ;
Coyle, David ;
Morrissey, Kellie ;
Clarke, Peter ;
Al-Shehri, Sara ;
Earley, David ;
Bandeira, Natasha .
PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON HUMAN-COMPUTER INTERACTION WITH MOBILE DEVICES AND SERVICES (MOBILEHCI '17), 2017,
[13]  
Cutler Anne, 2013, Prosody: Models and measurements, V14
[14]  
Dai Shuqi, 2023, P 24 INT SOC MUS INF, V1, P4
[15]  
Davis Kendrick, 2018, The Manic Mashups Charming China's Internet
[16]  
Deutsch Diana., 2008, J. Acoust. Soc. Am, V124, P10
[17]  
Donahue C, 2019, Arxiv, DOI [arXiv:1907.04868, 10.48550/arXiv.1907.04868]
[18]  
Donahue Chris, 2021, PROC ISMIR LATE BREA
[19]  
Donahue Chris, 2022, arXiv
[20]  
Donahue J, 2021, Arxiv, DOI arXiv:2006.03575