SpFormer: Spatio-Temporal Modeling for Scanpaths with Transformer

被引:0
|
作者
Zhong, Wenqi [1 ]
Yu, Linzhi [1 ]
Xia, Chen [1 ]
Han, Junwei [1 ]
Zhang, Dingwen [1 ]
机构
[1] Northwestern Polytech Univ, Sch Automat, Xian, Peoples R China
来源
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7 | 2024年
基金
中国国家自然科学基金;
关键词
VISUAL WORKING-MEMORY; EYE-MOVEMENTS; PREDICTION; TASK;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Saccadic scanpath, a data representation of human visual behavior, has received broad interest in multiple domains. Scanpath is a complex eye-tracking data modality that includes the sequences of fixation positions and fixation duration, coupled with image information. However, previous methods usually face the spatial misalignment problem of fixation features and loss of critical temporal data (including temporal correlation and fixation duration). In this study, we propose a Transformer-based scanpath model, SpFormer, to alleviate these problems. First, we propose a fixation-centric paradigm to extract the aligned spatial fixation features and tokenize the scanpaths. Then, according to the visual working memory mechanism, we design a local meta attention to reduce the semantic redundancy of fixations and guide the model to focus on the meta scanpath. Finally, we progressively integrate the duration information and fuse it with the fixation features to solve the problem of ambiguous location with the Transformer block increasing. We conduct extensive experiments on four databases under three tasks. The SpFormer establishes new state-of-the-art results in distinct settings, verifying its flexibility and versatility in practical applications. The code can be obtained from https://github.com/wenqizhong/SpFormer.
引用
收藏
页码:7605 / 7613
页数:9
相关论文
共 50 条
  • [1] SoftMatch: Comparing Scanpaths Using Combinatorial Spatio-Temporal Sequences with Fractal Curves
    Newport, Robert Ahadizad
    Russo, Carlo
    Liu, Sidong
    Al Suman, Abdulla
    Di Ieva, Antonio
    SENSORS, 2022, 22 (19)
  • [2] Modeling spatio-temporal field evolution
    Borstnik Bracic, A.
    Grabec, I.
    Govekar, E.
    EUROPEAN PHYSICAL JOURNAL B, 2009, 69 (04): : 529 - 538
  • [3] Modeling of vegetation cover and spatio-temporal variations
    Isler, Buket
    Aslan, Zafer
    JOURNAL OF THE FACULTY OF ENGINEERING AND ARCHITECTURE OF GAZI UNIVERSITY, 2021, 36 (04): : 1863 - 1874
  • [4] STTRE: A Spatio-Temporal Transformer with Relative Embeddings for multivariate time series
    Deihim, Azad
    Alonso, Eduardo
    Apostolopoulou, Dimitra
    NEURAL NETWORKS, 2023, 168 : 549 - 559
  • [5] Spatio-temporal modeling of soil characteristics for soilscape reconstruction
    Zwertvaegher, Ann
    Finke, Peter
    De Smedt, Philippe
    Gelorini, Vanessa
    Van Meirvenne, Marc
    Bats, Machteld
    De Reu, Jeroen
    Antrop, Marc
    Bourgeois, Jean
    De Maeyer, Philippe
    Verniers, Jacques
    Crombe, Philippe
    GEODERMA, 2013, 207 : 166 - 179
  • [6] Dimension-Reduced Modeling of Spatio-Temporal Processes
    Brynjarsdottir, Jenny
    Berliner, L. Mark
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2014, 109 (508) : 1647 - 1659
  • [7] Study of Spatio-Temporal Modeling in Video Quality Assessment
    Fang, Yuming
    Li, Zhaoqian
    Yan, Jiebin
    Sui, Xiangjie
    Liu, Hantao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 2693 - 2702
  • [8] Remaining useful life estimation of bearing using spatio-temporal convolutional transformer
    Zhu, De
    Lyu, Junwen
    Gao, Qingwei
    Lu, Yixiang
    Zhao, Dawei
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (04)
  • [9] Revisiting the robustness of spatio-temporal modeling in video quality assessment
    Yan, Jiebin
    Wu, Lei
    Jiang, Wenhui
    Liu, Chuanlin
    Shen, Fei
    DISPLAYS, 2024, 81
  • [10] Regime-based precipitation modeling: A spatio-temporal approach
    Euan, Carolina
    Sun, Ying
    Reich, Brian J.
    SPATIAL STATISTICS, 2024, 60