Deep multimodal representations and classification of first-episode psychosis via live face processing

被引:0
作者
Singh, Rahul [1 ,2 ,3 ]
Zhang, Yanlei [4 ]
Bhaskar, Dhananjay [1 ,5 ]
Srihari, Vinod [6 ]
Tek, Cenk [6 ]
Zhang, Xian [3 ,6 ]
Noah, J. Adam [3 ,6 ]
Krishnaswamy, Smita [1 ,2 ,5 ]
Hirsch, Joy [1 ,3 ,6 ,7 ,8 ,9 ]
机构
[1] Yale Univ, Wu Tsai Inst, New Haven, CT 06520 USA
[2] Yale Univ, Dept Comp Sci, New Haven, CT 06520 USA
[3] Yale Univ, Dept Psychiat, Brain Funct Lab, New Haven, CT 06520 USA
[4] Mila Quebec Inst, Montreal, PQ, Canada
[5] Yale Sch Med, Dept Genet, New Haven, CT 06510 USA
[6] Yale Univ, Dept Psychiat, New Haven, CT 06520 USA
[7] Yale Univ, Dept Comparat Med, New Haven, CT 06520 USA
[8] UCL, Dept Med Phys & Biomed Engn, London, England
[9] Yale Univ, Dept Neurosci, New Haven, CT 06520 USA
来源
FRONTIERS IN PSYCHIATRY | 2025年 / 16卷
关键词
RNN - recurrent neural network; face processing; multimodal representation; path signature feature; representation learning; first episode psychosis (FEP); GLOBAL ASSESSMENT; RELIABILITY; SCALE; PANSS; MODEL;
D O I
10.3389/fpsyt.2025.1518762
中图分类号
R749 [精神病学];
学科分类号
100205 ;
摘要
Schizophrenia is a severe psychiatric disorder associated with a wide range of cognitive and neurophysiological dysfunctions and long-term social difficulties. Early detection is expected to reduce the burden of disease by initiating early treatment. In this paper, we test the hypothesis that integration of multiple simultaneous acquisitions of neuroimaging, behavioral, and clinical information will be better for prediction of early psychosis than unimodal recordings. We propose a novel framework to investigate the neural underpinnings of the early psychosis symptoms (that can develop into Schizophrenia with age) using multimodal acquisitions of neural and behavioral recordings including functional near-infrared spectroscopy (fNIRS) and electroencephalography (EEG), and facial features. Our data acquisition paradigm is based on live face-toface interaction in order to study the neural correlates of social cognition in first-episode psychosis (FEP). We propose a novel deep representation learning framework, Neural-PRISM, for learning joint multimodal compressed representations combining neural as well as behavioral recordings. These learned representations are subsequently used to describe, classify, and predict the severity of early psychosis in patients, as measured by the Positive and Negative Syndrome Scale (PANSS) and Global Assessment of Functioning (GAF) scores to evaluate the impact of symptomatology. We found that incorporating joint multimodal representations from fNIRS and EEG along with behavioral recordings enhances classification between typical controls and FEP individuals (significant improvements between 10 - 20%). Additionally, our results suggest that geometric and topological features such as curvatures and path signatures of the embedded trajectories of brain activity enable detection of discriminatory neural characteristics in early psychosis.
引用
收藏
页数:12
相关论文
共 60 条
[1]   Global Assessment of Functioning (GAF): properties and frontier of current knowledge [J].
Aas, I. H. Monrad .
ANNALS OF GENERAL PSYCHIATRY, 2010, 9
[2]  
altrusaitis T., 2016, 2016 IEEE WINTER C A, P110
[3]   Analysis of functional connectivity using machine learning and deep learning in different data modalities from individuals with schizophrenia [J].
Alves, Caroline L. ;
Toutain, Thaise G. L. de O. ;
Porto, Joel Augusto Moura ;
Aguiar, Patricia Maria de Carvalho ;
de Sena, Eduardo Ponde ;
Rodrigues, Francisco A. ;
Pineda, Aruane M. ;
Thielemann, Christiane .
JOURNAL OF NEURAL ENGINEERING, 2023, 20 (05)
[4]   Multi-view manifold learning of human brain-state trajectories [J].
Busch, Erica L. ;
Huang, Jessie ;
Benz, Andrew ;
Wallenstein, Tom ;
Lajoie, Guillaume ;
Wolf, Guy ;
Krishnaswamy, Smita ;
Turk-Browne, Nicholas B. .
NATURE COMPUTATIONAL SCIENCE, 2023, 3 (03) :240-253
[5]   Generalizability of machine learning for classification of schizophrenia based on resting-state functional MRI data [J].
Cai, Xin-Lu ;
Xie, Dong-Jie ;
Madsen, Kristoffer H. ;
Wang, Yong-Ming ;
Bogemann, Sophie Alida ;
Cheung, Eric F. C. ;
Moller, Arne ;
Chan, Raymond C. K. .
HUMAN BRAIN MAPPING, 2020, 41 (01) :172-184
[6]  
Calhas D., 2023, medRxiv, P2023, DOI [10.1101/2023.08.07.23293748, DOI 10.1101/2023.08.07.23293748]
[7]   A nexus model of the temporal-parietal junction [J].
Carter, R. McKell ;
Huettel, Scott A. .
TRENDS IN COGNITIVE SCIENCES, 2013, 17 (07) :328-336
[8]   Distinguishing schizophrenia and bipolar disorder through a Multiclass Classification model based on multimodal neuroimaging data [J].
Chen, Ming ;
Xia, Xiaowei ;
Kang, Zhuang ;
Li, Zhinan ;
Dai, Jiamin ;
Wu, Junyan ;
Chen, Cai ;
Qiu, Yong ;
Liu, Tong ;
Liu, Yanxi ;
Zhang, Ziyi ;
Shen, Qingni ;
Tao, Sichu ;
Deng, Zixin ;
Lin, Ying ;
Wei, Qinling .
JOURNAL OF PSYCHIATRIC RESEARCH, 2024, 172 :119-128
[9]  
Chevyrev I, 2025, Arxiv, DOI [arXiv:1603.03788, DOI 10.48550/ARXIV.1603.03788]
[10]  
Coeurjolly D., 2001, Visual Form 2001. 4th International Workshop on Visual Form IWVF4. Proceedings (Lecture Notes in Computer Science Vol.2059), P303