Variational Autoencoder for Image-Based Augmentation of Eye-Tracking Data

被引:45
作者
Elbattah, Mahmoud [1 ]
Loughnane, Colm [2 ]
Guerin, Jean-Luc [1 ]
Carette, Romuald [1 ,3 ]
Cilia, Federica [4 ]
Dequen, Gilles [1 ]
机构
[1] Univ Picardie Jules Verne, Lab Modelisat, Informat, Syst MIS, F-80080 Amiens, France
[2] Univ Limerick, Fac Sci & Engn, Limerick V94 T9PX, Ireland
[3] Evolucare Technol, F-80800 Villers Bretonneux, France
[4] Univ Picardie Jules Verne, Lab CRP CPO, F-80000 Amiens, France
关键词
deep learning; variational autoencoder; data augmentation; eye-tracking; ATTENTION; REPRESENTATIONS;
D O I
10.3390/jimaging7050083
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
Over the past decade, deep learning has achieved unprecedented successes in a diversity of application domains, given large-scale datasets. However, particular domains, such as healthcare, inherently suffer from data paucity and imbalance. Moreover, datasets could be largely inaccessible due to privacy concerns, or lack of data-sharing incentives. Such challenges have attached significance to the application of generative modeling and data augmentation in that domain. In this context, this study explores a machine learning-based approach for generating synthetic eye-tracking data. We explore a novel application of variational autoencoders (VAEs) in this regard. More specifically, a VAE model is trained to generate an image-based representation of the eye-tracking output, so-called scanpaths. Overall, our results validate that the VAE model could generate a plausible output from a limited dataset. Finally, it is empirically demonstrated that such approach could be employed as a mechanism for data augmentation to improve the performance in classification tasks.
引用
收藏
页数:14
相关论文
共 67 条
[1]  
Abadi M, 2016, ACM SIGPLAN NOTICES, V51, P1, DOI [10.1145/2951913.2976746, 10.1145/3022670.2976746]
[2]  
[Anonymous], 2017, P 5 INT C LEARN REPR
[3]  
[Anonymous], 1935, How people look at pictures: a study of the psychology and perception in art
[4]  
[Anonymous], 2017, 170806020 ARXIV
[5]  
[Anonymous], GITHUB
[6]   PathGAN: Visual Scanpath Prediction with Generative Adversarial Networks [J].
Assens, Marc ;
Giro-i-Nieto, Xavier ;
McGuinness, Kevin ;
O'Connor, Noel E. .
COMPUTER VISION - ECCV 2018 WORKSHOPS, PT V, 2019, 11133 :406-422
[7]  
Asuncion A., UCI MACHINE LEARNING
[8]  
Bachman P., 2016, 161204739 ARXIV
[9]  
Barth E., 2017, 170202390 ARXIV
[10]  
Bengio S., 2015, 151106349 ARXIV