Top-down generation of low-resolution representations improves visual perception and imagination

被引:3
|
作者
Bi, Zedong [1 ]
Li, Haoran [2 ]
Tian, Liang [2 ,3 ,4 ,5 ]
机构
[1] Lingang Lab, Shanghai 200031, Peoples R China
[2] Hong Kong Baptist Univ, Dept Phys, Hong Kong, Peoples R China
[3] Hong Kong Baptist Univ, Inst Computat & Theoret Studies, Hong Kong, Peoples R China
[4] Hong Kong Baptist Univ, Inst Syst Med & Hlth Sci, Hong Kong, Peoples R China
[5] Hong Kong Baptist Univ, State Key Lab Environm & Biol Anal, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Generative model; Visual system; Sketch generation; RECEPTIVE-FIELDS; WORKING-MEMORY; DYNAMICS; INHIBITION; MECHANISMS; CORTEX;
D O I
10.1016/j.neunet.2023.12.030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Perception or imagination requires top-down signals from high-level cortex to primary visual cortex (V1) to reconstruct or simulate the representations bottom-up stimulated by the seen images. Interestingly, top-down signals in V1 have lower spatial resolution than bottom-up representations. It is unclear why the brain uses low-resolution signals to reconstruct or simulate high-resolution representations. By modeling the top-down pathway of the visual system using the decoder of a variational auto-encoder (VAE), we reveal that low resolution top-down signals can better reconstruct or simulate the information contained in the sparse activities of V1 simple cells, which facilitates perception and imagination. This advantage of low-resolution generation is related to facilitating high-level cortex to form geometry-respecting representations observed in experiments. Furthermore, we present two findings regarding this phenomenon in the context of AI-generated sketches, a style of drawings made of lines. First, we found that the quality of the generated sketches critically depends on the thickness of the lines in the sketches: thin-line sketches are harder to generate than thick-line sketches. Second, we propose a technique to generate high-quality thin-line sketches: instead of directly using original thin-line sketches, we use blurred sketches to train VAE or GAN (generative adversarial network), and then infer the thin-line sketches from the VAE-or GAN-generated blurred sketches. Collectively, our work suggests that low-resolution top-down generation is a strategy the brain uses to improve visual perception and imagination, which inspires new sketch-generation AI techniques.
引用
收藏
页码:440 / 456
页数:17
相关论文
共 50 条
  • [31] Top-down Visual Selective Attention Model Combined with Bottom-up Saliency Map for Incremental Object Perception
    Ban, Sang-Woo
    Kim, Bumhwi
    Lee, Minho
    2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [33] Top-Down Feedback Controls the Cortical Representation of Illusory Contours in Mouse Primary Visual Cortex
    Pak, Alexandr
    Ryu, Esther
    Li, Claudia
    Chubykin, Alexander A.
    JOURNAL OF NEUROSCIENCE, 2020, 40 (03) : 648 - 660
  • [34] Low-Resolution Place and Response Learning Capacities in Down Syndrome
    Bostelmann, Mathilde
    Costanzo, Floriana
    Martorana, Lorelay
    Menghini, Deny
    Vicari, Stefano
    Lavenex, Pamela Banta
    Lavenex, Pierre
    FRONTIERS IN PSYCHOLOGY, 2018, 9
  • [35] Auditory hallucinations, top-down processing and language perception: a general population study
    de Boer, J. N.
    Linszen, M. M. J.
    de Vries, J.
    Schutte, M. J. L.
    Begemann, M. J. H.
    Heringa, S. M.
    Bohlken, M. M.
    Hugdahl, K.
    Aleman, A.
    Wijnen, F. N. K.
    Sommer, I. E. C.
    PSYCHOLOGICAL MEDICINE, 2019, 49 (16) : 2772 - 2780
  • [36] Visual crowding involves delayed frontoparietal response and enhanced top-down modulation
    Han, Qiming
    Luo, Huan
    EUROPEAN JOURNAL OF NEUROSCIENCE, 2019, 50 (06) : 2931 - 2941
  • [37] Modeling the Interactions of Bottom-Up and Top-Down Guidance in Visual Attention
    Henderickx, David
    Maetens, Kathleen
    Geerinck, Thomas
    Soetens, Eric
    ATTENTION IN COGNITIVE SYSTEMS, 2009, 5395 : 197 - +
  • [38] Top-down modulation in human visual cortex predicts the stability of a perceptual illusion
    Kloosterman, Niels A.
    Meindertsma, Thomas
    Hillebrand, Arjan
    van Dijk, Bob W.
    Lamme, Victor A. F.
    Donner, Tobias H.
    JOURNAL OF NEUROPHYSIOLOGY, 2015, 113 (04) : 1063 - 1076
  • [39] Diminished Top-Down Control Underlies a Visual Imagery Deficit in Normal Aging
    Kalkstein, Jonathan
    Checksfield, Kristen
    Bollinger, Jacob
    Gazzaley, Adam
    JOURNAL OF NEUROSCIENCE, 2011, 31 (44) : 15768 - 15774
  • [40] Top-down modulation of DLPFC in visual search: a study based on fMRI and TMS
    Tian, Yin
    Tan, Congming
    Tan, Jianling
    Yang, Li
    Tang, Yi
    CEREBRAL CORTEX, 2024, 34 (02)