SpFormer: Spatio-Temporal Modeling for Scanpaths with Transformer

被引:0
|
作者
Zhong, Wenqi [1 ]
Yu, Linzhi [1 ]
Xia, Chen [1 ]
Han, Junwei [1 ]
Zhang, Dingwen [1 ]
机构
[1] Northwestern Polytech Univ, Sch Automat, Xian, Peoples R China
来源
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7 | 2024年
基金
中国国家自然科学基金;
关键词
VISUAL WORKING-MEMORY; EYE-MOVEMENTS; PREDICTION; TASK;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Saccadic scanpath, a data representation of human visual behavior, has received broad interest in multiple domains. Scanpath is a complex eye-tracking data modality that includes the sequences of fixation positions and fixation duration, coupled with image information. However, previous methods usually face the spatial misalignment problem of fixation features and loss of critical temporal data (including temporal correlation and fixation duration). In this study, we propose a Transformer-based scanpath model, SpFormer, to alleviate these problems. First, we propose a fixation-centric paradigm to extract the aligned spatial fixation features and tokenize the scanpaths. Then, according to the visual working memory mechanism, we design a local meta attention to reduce the semantic redundancy of fixations and guide the model to focus on the meta scanpath. Finally, we progressively integrate the duration information and fuse it with the fixation features to solve the problem of ambiguous location with the Transformer block increasing. We conduct extensive experiments on four databases under three tasks. The SpFormer establishes new state-of-the-art results in distinct settings, verifying its flexibility and versatility in practical applications. The code can be obtained from https://github.com/wenqizhong/SpFormer.
引用
收藏
页码:7605 / 7613
页数:9
相关论文
共 50 条
  • [31] Deep learning for spatio-temporal modeling: Dynamic traffic flows and high frequency trading
    Dixon, Matthew F.
    Polson, Nicholas G.
    Sokolov, Vadim O.
    APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY, 2019, 35 (03) : 788 - 807
  • [32] Spatio-Temporal Modeling and Spatial Inference Using NA-CORDEX Climate Data
    Lin, Wenyi
    Schwartzman, Armin
    JOURNAL OF AGRICULTURAL BIOLOGICAL AND ENVIRONMENTAL STATISTICS, 2025,
  • [33] SPATIO-TEMPORAL EXCEEDANCE LOCATIONS AND CONFIDENCE REGIONS
    French, Joshua P.
    Sain, Stephan R.
    ANNALS OF APPLIED STATISTICS, 2013, 7 (03): : 1421 - 1449
  • [34] Spatio-temporal topography of saccadic overestimation of time
    Knoell, Jonas
    Morrone, M. Concetta
    Bremmer, Frank
    VISION RESEARCH, 2013, 83 : 56 - 65
  • [35] Spatio-temporal DeepKriging for interpolation and probabilistic forecasting
    Nag, Pratik
    Sun, Ying
    Reich, Brian J.
    SPATIAL STATISTICS, 2023, 57
  • [36] Learning to rank spatio-temporal event hotspots
    Mohler, George
    Porter, Michael
    Carter, Jeremy
    LaFree, Gary
    CRIME SCIENCE, 2020, 9 (01)
  • [37] A spatio-temporal interaction on the apparent motion trace
    Schwiedrzik, C. M.
    Alink, A.
    Kohler, A.
    Singer, W.
    Muckli, L.
    VISION RESEARCH, 2007, 47 (28) : 3424 - 3433
  • [38] Time varying spatio-temporal covariance models
    Ip, Ryan H. L.
    Li, W. K.
    SPATIAL STATISTICS, 2015, 14 : 269 - 285
  • [39] Fixed Rank Filtering for Spatio-Temporal Data
    Cressie, Noel
    Shi, Tao
    Kang, Emily L.
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2010, 19 (03) : 724 - 745
  • [40] Multivariate Kalman filtering for spatio-temporal processes
    Ferreira, Guillermo
    Mateu, Jorge
    Porcu, Emilio
    STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2022, 36 (12) : 4337 - 4354