Fixation Prediction through Multimodal Analysis

被引:0
|
作者
Min, Xiongkuo [1 ]
Zhai, Guangtao [1 ]
Hu, Chunjia [1 ]
Gu, Ke [1 ]
机构
[1] Shanghai Jiao Tong Univ, Inst Image Commun & Network Engn, Shanghai, Peoples R China
来源
2015 VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP) | 2015年
关键词
Audio-visual attention; multimodal analysis; saliency; fixation prediction; attention fusion; MODEL;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose to predict human fixations by incorporating both audio and visual cues. Traditional visual attention models generally make the utmost of stimuli's visual features, while discarding all audio information. But in the real world, we human beings not only direct our gaze according to visual saliency but also may be attracted by some salient audio. Psychological experiments show that audio may have some influence on visual attention, and subjects tend to be attracted the sound sources. Therefore, we propose to fuse both audio and visual information to predict fixations. In our framework, we first localize the moving-sounding objects through multimodal analysis and generate an audio attention map, in which greater value denotes higher possibility of a position being the sound source. Then we calculate the spatial and temporal attention maps using only the visual modality. At last, the audio, spatial and temporal attention maps are fused, generating our final audio-visual saliency map. We gather a set of videos and collect eye-tracking data under audio-visual test conditions. Experiment results show that we can achieve better performance when considering both audio and visual cues.
引用
收藏
页数:4
相关论文
共 50 条
  • [31] A novel grey prediction evolution algorithm for multimodal multiobjective optimization
    Zhou, Ting
    Hu, Zhongbo
    Zhou, Quan
    Yuan, Shixiong
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 100
  • [32] Patient specific tumor growth prediction using multimodal images
    Liu, Yixun
    Sadowski, Samira M.
    Weisbrod, Allison B.
    Kebebew, Electron
    Summers, Ronald M.
    Yao, Jianhua
    MEDICAL IMAGE ANALYSIS, 2014, 18 (03) : 555 - 566
  • [33] Uncertainty-Aware Yield Prediction with Multimodal Molecular Features
    Chen, Jiayuan
    Guo, Kehan
    Liu, Zhen
    Isayev, Olexandr
    Zhang, Xiangliang
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 8, 2024, : 8274 - 8282
  • [34] Exploiting inter-image similarity and ensemble of extreme learners for fixation prediction using deep features
    Tavakoli, Hamed R.
    Borji, Ali
    Laaksonen, Jorma
    Rahtu, Esa
    NEUROCOMPUTING, 2017, 244 : 10 - 18
  • [35] How Do Drivers Allocate Their Potential Attention? Driving Fixation Prediction via Convolutional Neural Networks
    Deng, Tao
    Yan, Hongmei
    Qin, Long
    Thuyen Ngo
    Manjunath, B. S.
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (05) : 2146 - 2154
  • [36] Price prediction of e-commerce products through Internet sentiment analysis
    Tseng, Kuo-Kun
    Lin, Regina Fang-Ying
    Zhou, Hongfu
    Kurniajaya, Kevin Jati
    Li, Qianyu
    ELECTRONIC COMMERCE RESEARCH, 2018, 18 (01) : 65 - 88
  • [37] CFN: A coarse-to-fine network for eye fixation prediction
    Xu, Binwei
    Liang, Haoran
    Liang, Ronghua
    Chen, Peng
    IET IMAGE PROCESSING, 2022, 16 (09) : 2373 - 2383
  • [38] Driver's Eye Fixation Prediction by Deep Neural Network
    Shirpour, Mohsen
    Beauchemin, Steven S.
    Bauer, Michael A.
    VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 4: VISAPP, 2021, : 67 - 75
  • [39] Semantic meaning modulates object importance in human fixation prediction
    Li, Aoqi
    Chen, Zhenzhong
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 79
  • [40] VIDEO QUALITY METRIC BASED ON FIXATION PREDICTION AND FOVEAL IMAGING
    You, Junyong
    Ebrahimi, Touradj
    Perkis, Andrew
    2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 1509 - 1512