Top-down generation of low-resolution representations improves visual perception and imagination

被引:3
|
作者
Bi, Zedong [1 ]
Li, Haoran [2 ]
Tian, Liang [2 ,3 ,4 ,5 ]
机构
[1] Lingang Lab, Shanghai 200031, Peoples R China
[2] Hong Kong Baptist Univ, Dept Phys, Hong Kong, Peoples R China
[3] Hong Kong Baptist Univ, Inst Computat & Theoret Studies, Hong Kong, Peoples R China
[4] Hong Kong Baptist Univ, Inst Syst Med & Hlth Sci, Hong Kong, Peoples R China
[5] Hong Kong Baptist Univ, State Key Lab Environm & Biol Anal, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Generative model; Visual system; Sketch generation; RECEPTIVE-FIELDS; WORKING-MEMORY; DYNAMICS; INHIBITION; MECHANISMS; CORTEX;
D O I
10.1016/j.neunet.2023.12.030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Perception or imagination requires top-down signals from high-level cortex to primary visual cortex (V1) to reconstruct or simulate the representations bottom-up stimulated by the seen images. Interestingly, top-down signals in V1 have lower spatial resolution than bottom-up representations. It is unclear why the brain uses low-resolution signals to reconstruct or simulate high-resolution representations. By modeling the top-down pathway of the visual system using the decoder of a variational auto-encoder (VAE), we reveal that low resolution top-down signals can better reconstruct or simulate the information contained in the sparse activities of V1 simple cells, which facilitates perception and imagination. This advantage of low-resolution generation is related to facilitating high-level cortex to form geometry-respecting representations observed in experiments. Furthermore, we present two findings regarding this phenomenon in the context of AI-generated sketches, a style of drawings made of lines. First, we found that the quality of the generated sketches critically depends on the thickness of the lines in the sketches: thin-line sketches are harder to generate than thick-line sketches. Second, we propose a technique to generate high-quality thin-line sketches: instead of directly using original thin-line sketches, we use blurred sketches to train VAE or GAN (generative adversarial network), and then infer the thin-line sketches from the VAE-or GAN-generated blurred sketches. Collectively, our work suggests that low-resolution top-down generation is a strategy the brain uses to improve visual perception and imagination, which inspires new sketch-generation AI techniques.
引用
收藏
页码:440 / 456
页数:17
相关论文
共 50 条
  • [1] How to (and how not to) think about top-down influences on visual perception
    Teufel, Christoph
    Nanay, Bence
    CONSCIOUSNESS AND COGNITION, 2017, 47 : 17 - 25
  • [2] Review of Theories and Data on the Role of Top-Down Pathways in Visual Perception
    Maric, Mateja
    PSIHOLOGIJSKE TEME, 2024, 33 (02): : 333 - 354
  • [3] Distinct Top-down and Bottom-up Brain Connectivity During Visual Perception and Imagery
    Dijkstra, N.
    Zeidman, P.
    Ondobaka, S.
    van Gerven, M. A. J.
    Friston, K.
    SCIENTIFIC REPORTS, 2017, 7
  • [4] Top-down control of visual cortex in migraine populations
    Mickleborough, Marla J. S.
    Truong, Grace
    Handy, Todd C.
    NEUROPSYCHOLOGIA, 2011, 49 (05) : 1006 - 1015
  • [5] Top-down modulation of visual and auditory cortical processing in aging
    Guerreiro, Maria J. S.
    Eck, Judith
    Moerel, Michelle
    Evers, Elisabeth A. T.
    Van Gerven, Pascal W. M.
    BEHAVIOURAL BRAIN RESEARCH, 2015, 278 : 226 - 234
  • [6] Functional connectivity during top-down modulation of visual short-term memory representations
    Kuo, Bo-Cheng
    Yeh, Yei-Yu
    Chen, Anthony J. -W.
    D'Esposito, Mark
    NEUROPSYCHOLOGIA, 2011, 49 (06) : 1589 - 1596
  • [7] Top-Down Control of Rhythm Perception Modulates Early Auditory Responses
    Iversen, John R.
    Repp, Bruno H.
    Patel, Aniruddh D.
    NEUROSCIENCES AND MUSIC III: DISORDERS AND PLASTICITY, 2009, 1169 : 58 - 73
  • [8] Prefrontal cortex modulates posterior alpha oscillations during top-down guided visual perception
    Helfrich, Randolph F.
    Huang, Melody
    Wilson, Guy
    Knight, Robert T.
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2017, 114 (35) : 9457 - 9462
  • [9] Top-Down Control of Visual Responses to Fear by the Amygdala
    Furl, Nicholas
    Henson, Richard N.
    Friston, Karl J.
    Calder, Andrew J.
    JOURNAL OF NEUROSCIENCE, 2013, 33 (44) : 17435 - 17443
  • [10] Top-Down Modulation of Lateral Interactions in Visual Cortex
    Ramalingam, Nirmala
    McManus, Justin N. J.
    Li, Wu
    Gilbert, Charles D.
    JOURNAL OF NEUROSCIENCE, 2013, 33 (05) : 1773 - 1789