MIMOSA: Human-AI Co-Creation of Computational Spatial Audio Effects on Videos

被引:0
|
作者
Ning, Zheng [1 ]
Zhang, Zheng [1 ]
Ban, Jerrick [1 ]
Jiang, Kaiwen [2 ]
Gan, Ruohong [3 ]
Tian, Yapeng [4 ]
Li, Toby Jia-Jun [1 ]
机构
[1] Univ Notre Dame, Notre Dame, IN 46556 USA
[2] Univ Calif San Diego, La Jolla, CA 92093 USA
[3] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[4] Univ Texas Dallas, Richardson, TX 75083 USA
来源
PROCEEDINGS OF THE 16TH CONFERENCE ON CREATIVITY AND COGNITION, C&C 2024 | 2024年
关键词
video; sound effects; multimodal; creator tools;
D O I
10.1145/3635636.3656189
中图分类号
J [艺术];
学科分类号
13 ; 1301 ;
摘要
Spatial audio offers more immersive video consumption experiences to viewers; however, creating and editing spatial audio often expensive and requires specialized hardware equipment and skills, posing a high barrier for amateur video creators. We present MIMOSA, a human-AI co-creation tool that enables amateur users to computationally generate and manipulate spatial audio effects. For a video with only monaural or stereo audio, MIMOSA automatically grounds each sound source to the corresponding sounding object in the visual scene and enables users to further validate and fix errors in the location of the sounding objects. Users can also augment the spatial audio effect by flexibly manipulating the sounding source positions and creatively customizing the audio effect. The design of MIMOSA exemplifies a human-AI collaboration approach that, instead of utilizing state-of-art end-to-end "black-box" ML models, uses a multistep pipeline that aligns its interpretable intermediate results with the user's workflow. A lab user study with 15 participants demonstrates MIMOSA's usability, usefulness, expressiveness, and capability in creating immersive spatial audio effects in collaboration with users.
引用
收藏
页码:156 / 169
页数:14
相关论文
共 50 条
  • [1] AI Creativity and the Human-AI Co-creation Model
    Wu, Zhuohao
    Ji, Danwen
    Yu, Kaiwen
    Zeng, Xianxu
    Wu, Dingming
    Shidujaman, Mohammad
    HUMAN-COMPUTER INTERACTION: THEORY, METHODS AND TOOLS, HCII 2021, PT I, 2021, 12762 : 171 - 190
  • [2] Research on human-AI co-creation based on reflective design practice
    Fu, Zhiyong
    Zhou, Yuyao
    CCF TRANSACTIONS ON PERVASIVE COMPUTING AND INTERACTION, 2020, 2 (01) : 33 - 41
  • [3] ReelFramer: Human-AI Co-Creation for News-to-Video Translation
    Wang, Sitong
    Menon, Samia
    Long, Tao
    Henderson, Keren
    Li, Dingzeyu
    Crowston, Kevin
    Hansen, Mark
    Nickerson, Jefrey V.
    Chilton, Lydia B.
    PROCEEDINGS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYTEMS (CHI 2024), 2024,
  • [4] Sand Playground: Designing Human-AI physical Interface for Co-creation in Motion
    El-Zanfaly, Dina
    Huang, Yiwei
    Dong, Yanwen
    PROCEEDINGS OF THE 14TH CREATIVITY AND COGNITION, C&C 2022, 2022, : 49 - 55
  • [5] ContextCam: Bridging Context Awareness with Creative Human-AI Image Co-Creation
    Fan, Xianzhe
    Wu, Zihan
    Yu, Chun
    Rao, Fenggui
    Shi, Weinan
    Tu, Teng
    PROCEEDINGS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYTEMS, CHI 2024, 2024,
  • [6] DeepThInk: Designing and probing human-AI co-creation in digital art therapy
    Du, Xuejun
    An, Pengcheng
    Leung, Justin
    Li, April
    Chapman, Linda E.
    Zhao, Jian
    INTERNATIONAL JOURNAL OF HUMAN-COMPUTER STUDIES, 2024, 181
  • [7] Large Language Models for Human-AI Co-Creation of Robotic Dance Performances
    De Filippo, Allegra
    Milano, Michela
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 7627 - 7635
  • [8] HAI-GEN 2020: Workshop on Human-AI Co-Creation with Generative Models
    Geyer, Werner
    Chilton, Lydia B.
    Kumar, Ranjitha
    Kalai, Adam Tauman
    PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT USER INTERFACES COMPANION (IUI'20), 2020, : 13 - 14
  • [9] An Evidence-based Workflow for Studying and Designing Learning Supports for Human-AI Co-creation
    Gmeiner, Frederic
    Conlin, Jamie
    Tang, Eric
    Martelaro, Nikolas
    Holstein, Kenneth
    EXTENDED ABSTRACTS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2024, 2024,
  • [10] Human-AI Co-creation for Intangible Cultural Heritage Dance: Cultural Genes Retaining and Innovation
    Zhu, Hongtao
    Zhou, Xiaoxuan
    Liu, Huiwen
    HCI INTERNATIONAL 2024 POSTERS, PT III, HCII 2024, 2024, 2116 : 426 - 433