I See What You're Hearing: Facilitating The Effect of Environment on Perceived Emotion While Teleconferencing

被引:1
作者
Marino D. [1 ]
Henry M. [1 ]
Fortin P.E. [2 ]
Bhayana R. [3 ]
Cooperstock J. [1 ]
机构
[1] McGill University, Center for Intelligent Machines, Montreal, H3A 0G4, QC
[2] McGill University, Dept. of Electrical and Computer Engineering, Montreal, H3A 0G4, QC
[3] Indraprastha Institute of Information Technology, Dept. of Human Centered Design, New Delhi, Delhi
关键词
context; multimodal; teleconferencing; visualization;
D O I
10.1145/3579495
中图分类号
学科分类号
摘要
Our perception of emotion is highly contextual. Changes in the environment can affect our narrative framing, and thus augment our emotional perception of interlocutors. User environments are typically heavily suppressed due to the technical limitations of commercial videoconferencing platforms. As a result, there is often a lack of contextual awareness while participating in a video call, and this affects how we perceive the emotions of conversants. We present a videoconferencing module that visualizes the user's aural environment to enhance awareness between interlocutors. The system visualizes environmental sound based on its semantic and acoustic properties. We found that our visualization system was about 50% effective at eliciting emotional perceptions in users that was similar to the response elicited by environmental sound it replaced.The contributed system provides a unique approach to facilitate ambient awareness on an implicit emotional level in situations where multimodal environmental context is suppressed. © 2023 ACM.
引用
收藏
相关论文
共 35 条
[1]  
Adalgeirsson S.O., Breazeal C., MeBot: A robotic platform for socially embodied telepresence, 2010 5th ACM/IEEE International Conference on Human-Robot Interaction (HRI)., pp. 15-22, (2010)
[2]  
Aubrey A.J., Marshall D., Rosin P.L., Vendeventer J., Cunningham D.W., Wallraven C., Cardiff conversation database (ccdb): A database of natural dyadic conversations, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops., pp. 277-282, (2013)
[3]  
Feldman Barrett L., Mesquita B., Gendron M., Context in emotion perception, Current Directions in Psychological Science, 20, 5, pp. 286-290, (2011)
[4]  
Bergstrom T., Karahalios K., Conversation Clock: Visualizing audio patterns in co-located groups, 2007 40th Annual Hawaii International Conference on System Sciences (HICSS'07). IEEE, pp. 78-78, (2007)
[5]  
Bucci P.H., Laura Cang X., Mah H., Rodgers L., MacLean K.E., Real Emotions Don't Stand Still: Toward Ecologically Viable Representation of Affective Interaction, 2019 8th International Conference on Affective Computing and Intelligent Interaction (ACII). IEEE, pp. 1-7, (2019)
[6]  
Cartwright M., Seals A., Salamon J., Williams A., Mikloska S., MacConnell D., Law E., Bello J.P., Nov O., Seeing sound: Investigating the effects of visualizations and complexity on crowdsourced audio annotations, Proceedings of the ACM on Human-Computer Interaction, 1, CSCW, pp. 1-21, (2017)
[7]  
Chen Y., Huang P., Woods A., Spence C., When "Bouba" equals "Kiki": Cultural commonalities and cultural differences in sound-shape correspondences, Scientific reports, 6, 1, pp. 1-9, (2016)
[8]  
Jeremy R.C., Multimodal telepresence systems, IEEE Signal Processing Magazine, 28, 1, pp. 77-86, (2010)
[9]  
De Saussure F., Course in general linguistics, (2011)
[10]  
Donath J., Visiphone: Connecting domestic spaces with audio, International Conference on Auditory Display, (2000)