Top-down generation of low-resolution representations improves visual perception and imagination

被引:3
|
作者
Bi, Zedong [1 ]
Li, Haoran [2 ]
Tian, Liang [2 ,3 ,4 ,5 ]
机构
[1] Lingang Lab, Shanghai 200031, Peoples R China
[2] Hong Kong Baptist Univ, Dept Phys, Hong Kong, Peoples R China
[3] Hong Kong Baptist Univ, Inst Computat & Theoret Studies, Hong Kong, Peoples R China
[4] Hong Kong Baptist Univ, Inst Syst Med & Hlth Sci, Hong Kong, Peoples R China
[5] Hong Kong Baptist Univ, State Key Lab Environm & Biol Anal, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Generative model; Visual system; Sketch generation; RECEPTIVE-FIELDS; WORKING-MEMORY; DYNAMICS; INHIBITION; MECHANISMS; CORTEX;
D O I
10.1016/j.neunet.2023.12.030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Perception or imagination requires top-down signals from high-level cortex to primary visual cortex (V1) to reconstruct or simulate the representations bottom-up stimulated by the seen images. Interestingly, top-down signals in V1 have lower spatial resolution than bottom-up representations. It is unclear why the brain uses low-resolution signals to reconstruct or simulate high-resolution representations. By modeling the top-down pathway of the visual system using the decoder of a variational auto-encoder (VAE), we reveal that low resolution top-down signals can better reconstruct or simulate the information contained in the sparse activities of V1 simple cells, which facilitates perception and imagination. This advantage of low-resolution generation is related to facilitating high-level cortex to form geometry-respecting representations observed in experiments. Furthermore, we present two findings regarding this phenomenon in the context of AI-generated sketches, a style of drawings made of lines. First, we found that the quality of the generated sketches critically depends on the thickness of the lines in the sketches: thin-line sketches are harder to generate than thick-line sketches. Second, we propose a technique to generate high-quality thin-line sketches: instead of directly using original thin-line sketches, we use blurred sketches to train VAE or GAN (generative adversarial network), and then infer the thin-line sketches from the VAE-or GAN-generated blurred sketches. Collectively, our work suggests that low-resolution top-down generation is a strategy the brain uses to improve visual perception and imagination, which inspires new sketch-generation AI techniques.
引用
收藏
页码:440 / 456
页数:17
相关论文
共 50 条
  • [41] Top-down modulation of visual cortical processing after transient congenital blindness
    Guerreiro, Maria J. S.
    Kekunnaya, Ramesh
    Roeder, Brigitte
    NEUROPSYCHOLOGIA, 2022, 174
  • [42] Bottom-up and top-down brain functional connectivity underlying comprehension of everyday visual action
    Hanson, S. J.
    Hanson, C.
    Halchenko, Y.
    Matsuka, T.
    Zaimi, A.
    BRAIN STRUCTURE & FUNCTION, 2007, 212 (3-4) : 231 - 244
  • [43] Unseeing the White Bear: Negative Search Criteria Guide Visual Attention Through Top-Down Suppression
    Forstinger, Marlene
    Gruener, Markus
    Ansorge, Ulrich
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 2022, 48 (06) : 613 - 638
  • [44] Learning selective top-down control enhances performance in a visual categorization task
    Pannunzi, Mario
    Gigante, Guido
    Mattia, Maurizio
    Deco, Gustavo
    Fusi, Stefano
    Del Giudice, Paolo
    JOURNAL OF NEUROPHYSIOLOGY, 2012, 108 (11) : 3124 - 3137
  • [45] Deficits in multi-scale top-down processes distorting auditory perception in schizophrenia
    Yang, Fuyin
    Zhu, Hao
    Yu, Lingfang
    Lu, Weihong
    Zhang, Chen
    Tian, Xing
    BEHAVIOURAL BRAIN RESEARCH, 2021, 412
  • [46] Bottom-up and top-down emotion generation: implications for emotion regulation
    McRae, Kateri
    Misra, Supriya
    Prasad, Aditya K.
    Pereira, Sean C.
    Gross, James J.
    SOCIAL COGNITIVE AND AFFECTIVE NEUROSCIENCE, 2012, 7 (03) : 253 - 262
  • [47] Functional interactions between prefrontal and visual association cortex contribute to top-down modulation of visual processing
    Gazzaley, Adam
    Rissman, Jesse
    Cooney, Jeffrev
    Rutman, Aaron
    Seibert, Tyler
    Clapp, Wesley
    D'Esposito, Mark
    CEREBRAL CORTEX, 2007, 17 : I125 - I135
  • [48] From the heart to the mind: cardiac vagal tone modulates top-down and bottom-up visual perception and attention to emotional stimuli
    Park, Gewnhi
    Thayer, Julian F.
    FRONTIERS IN PSYCHOLOGY, 2014, 5
  • [49] What Top-Down Task Sets Do For Us: An ERP Study on The Benefits of Advance Preparation in Visual Search
    Eimer, Martin
    Kiss, Monika
    Nicholas, Susan
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 2011, 37 (06) : 1758 - 1766
  • [50] Top-Down Control of Visual Alpha Oscillations: Sources of Control Signals and Their Mechanisms of Action
    Wang, Chao
    Rajagovindan, Rajasimhan
    Han, Sahng-Min
    Ding, Mingzhou
    FRONTIERS IN HUMAN NEUROSCIENCE, 2016, 10