Prediction of visual attention with deep CNN on artificially degraded videos for studies of attention of patients with Dementia

被引:12
作者
Chaabouni, Souad [1 ,2 ]
Benois-Pineau, Jenny [1 ]
Tison, Francois [3 ]
Ben Amar, Chokri [2 ]
Zemmari, Akka [1 ]
机构
[1] Univ Bordeaux, LaBRI UMR 5800, F-33400 Talence, France
[2] Univ Sfax, REGIM Lab LR11ES48, Sfax 3029, Tunisia
[3] CHU Bordeaux GH Pellegrin, Bordeaux, France
关键词
Deep CNN; Saliency; Normal video; Degraded video; Dementia diseases; POPULATIONS;
D O I
10.1007/s11042-017-4796-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Studies of visual attention of patients with Dementia such as Parkinson's Disease Dementia and Alzheimer Disease is a promising way for non-invasive diagnostics. Past research showed, that people suffering from dementia are not reactive with regard to degradations on still images. Attempts are being made to study their visual attention relatively to the video content. Here the delays in their reactions on novelty and "unusual" novelty of the visual scene are expected. Nevertheless, large-scale screening of population is possible only if sufficiently robust automatic prediction models can be built. In the medical protocols the detection of Dementia behavior in visual content observation is always performed in comparison with healthy, "normal control" subjects. Hence, it is a research question per see as to develop an automatic prediction models for specific visual content to use in psycho-visual experience involving Patients with Dementia (PwD). The difficulty of such a prediction resides in a very small amount of training data. In this paper the reaction of healthy normal control subjects on degraded areas in videos was studied. Furthermore, in order to build an automatic prediction model for salient areas in intentionally degraded videos for PwD studies, a deep learning architecture was designed. Optimal transfer learning strategy for training the model in case of very small amount of training data was deployed. The comparison with gaze fixation maps and classical visual attention prediction models was performed. Results are interesting regarding the reaction of normal control subjects against degraded areas in videos.
引用
收藏
页码:22527 / 22546
页数:20
相关论文
共 33 条
  • [21] Long Mai, 2011, Proceedings of the 2011 IEEE International Symposium on Multimedia (ISM 2011), P91, DOI 10.1109/ISM.2011.23
  • [22] Retinal nerve fiber layer structure abnormalities in early Alzheimer's disease: Evidence in optical coherence tomography
    Lu, Yan
    Li, Zhen
    Zhang, Xinqing
    Ming, Baoquan
    Jia, Jianping
    Wan, Rong
    Ma, Daqing
    [J]. NEUROSCIENCE LETTERS, 2010, 480 (01) : 69 - 72
  • [23] Marszalek M., 2009, IEEE C COMP VIS PATT
  • [24] Actions in the Eye: Dynamic Gaze Datasets and Learnt Saliency Models for Visual Recognition
    Mathe, Stefan
    Sminchisescu, Cristian
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (07) : 1408 - 1424
  • [25] Bottom-up and top-down attention are independent
    Pinto, Yair
    van der Leij, Andries R.
    Sligte, Ilja G.
    Lamme, Victor A. F.
    Scholte, H. Steven
    [J]. JOURNAL OF VISION, 2013, 13 (03): : 16
  • [26] Static and space-time visual saliency detection by self-resemblance
    Seo, Hae Jong
    Milanfar, Peyman
    [J]. JOURNAL OF VISION, 2009, 9 (12):
  • [27] Learning to predict eye fixations for semantic contents using multi-layer sparse network
    Shen, Chengyao
    Zhao, Qi
    [J]. NEUROCOMPUTING, 2014, 138 : 61 - 68
  • [28] Simonyan K., 2013, 13126034 ARXIV
  • [29] FEATURE-INTEGRATION THEORY OF ATTENTION
    TREISMAN, AM
    GELADE, G
    [J]. COGNITIVE PSYCHOLOGY, 1980, 12 (01) : 97 - 136
  • [30] High-throughput classification of clinical populations from natural viewing eye movements
    Tseng, Po-He
    Cameron, Ian G. M.
    Pari, Giovanna
    Reynolds, James N.
    Munoz, Douglas P.
    Itti, Laurent
    [J]. JOURNAL OF NEUROLOGY, 2013, 260 (01) : 275 - 284