Fixation Prediction through Multimodal Analysis

被引:0
|
作者
Min, Xiongkuo [1 ]
Zhai, Guangtao [1 ]
Hu, Chunjia [1 ]
Gu, Ke [1 ]
机构
[1] Shanghai Jiao Tong Univ, Inst Image Commun & Network Engn, Shanghai, Peoples R China
来源
2015 VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP) | 2015年
关键词
Audio-visual attention; multimodal analysis; saliency; fixation prediction; attention fusion; MODEL;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose to predict human fixations by incorporating both audio and visual cues. Traditional visual attention models generally make the utmost of stimuli's visual features, while discarding all audio information. But in the real world, we human beings not only direct our gaze according to visual saliency but also may be attracted by some salient audio. Psychological experiments show that audio may have some influence on visual attention, and subjects tend to be attracted the sound sources. Therefore, we propose to fuse both audio and visual information to predict fixations. In our framework, we first localize the moving-sounding objects through multimodal analysis and generate an audio attention map, in which greater value denotes higher possibility of a position being the sound source. Then we calculate the spatial and temporal attention maps using only the visual modality. At last, the audio, spatial and temporal attention maps are fused, generating our final audio-visual saliency map. We gather a set of videos and collect eye-tracking data under audio-visual test conditions. Experiment results show that we can achieve better performance when considering both audio and visual cues.
引用
收藏
页数:4
相关论文
共 50 条
  • [41] Multimodal analysis of startle type responses
    Cosic, Kresimir
    Popovic, Sinisa
    Kukolja, Davor
    Dropuljic, Branimir
    Ivanec, Dragutin
    Tonkovic, Mirjana
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2016, 129 : 186 - 202
  • [42] Multimodal Egocentric Analysis of Focused Interactions
    Bano, Sophia
    Suveges, Tamas
    Zhang, Jianguo
    McKenna, Stephen J.
    IEEE ACCESS, 2018, 6 : 37493 - 37505
  • [43] Multimodal method for landslide risk analysis
    Pollock, William
    Grant, Alex
    Wartman, Joseph
    Abou-Jaoude, Grace
    METHODSX, 2019, 6 : 827 - 836
  • [44] Beyond Doctors: Future Health Prediction from Multimedia and Multimodal Observations
    Nie, Liqiang
    Zhang, Luming
    Yang, Yi
    Wang, Meng
    Hong, Richang
    Chua, Tat-Seng
    MM'15: PROCEEDINGS OF THE 2015 ACM MULTIMEDIA CONFERENCE, 2015, : 591 - 600
  • [45] Multimodal Memetic Framework for low-resolution protein structure prediction
    Nazmul, Rumana
    Chetty, Madhu
    Chowdhury, Ahsan Raja
    SWARM AND EVOLUTIONARY COMPUTATION, 2020, 52
  • [46] Sleep stage prediction using multimodal body network and circadian rhythm
    Waqar, Sahar
    Khan, Muhammad Usman Ghani
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [47] Multimodal survival prediction in advanced pancreatic cancer using machine learning
    Keyl, J.
    Kasper, S.
    Wiesweg, M.
    Goetze, J.
    Schoenrock, M.
    Sinn, M.
    Berger, A.
    Nasca, E.
    Kostbade, K.
    Schumacher, B.
    Markus, P.
    Albers, D.
    Treckmann, J.
    Schmid, K. W.
    Schildhaus, H-U
    Siveke, J. T.
    Schuler, M.
    Kleesiek, J.
    ESMO OPEN, 2022, 7 (05)
  • [48] Biomechanical Changes of Adjacent and Fixed Segments Through Cortical Bone Trajectory Screw Fixation versus Traditional Trajectory Screw Fixation in the Lumbar Spine: A Finite Element Analysis
    Zhang, Lai
    Li, Hui-Min
    Zhang, Renjie
    Zhang, Huaqing
    Shen, Cai-Liang
    WORLD NEUROSURGERY, 2021, 151 : E447 - E456
  • [49] Multimodal deep learning methods enhance genomic prediction of wheat breeding
    Montesinos-Lopez, Abelardo
    Rivera, Carolina
    Pinto, Francisco
    Pinera, Francisco
    Gonzalez, David
    Reynolds, Mathew
    Perez-Rodriguez, Paulino
    Li, H.
    Montesinos-Lopez, Osval A.
    Crossa, Jose
    G3-GENES GENOMES GENETICS, 2023, 13 (05):
  • [50] Innovation in audiovisuals through multimodal educational programs for primary education
    Alvarez-Rodriguez, Dolores
    EARI-EDUCACION ARTISTICA-REVISTA DE INVESTIGACION, 2019, (10): : 210 - 222