Molecular Cavity Topological Representation for Pattern Analysis: A NLP Analogy-Based Word2Vec Method

被引:7
作者
Guo, Dongliang [1 ,2 ]
Wang, Qiaoqiao [1 ]
Liang, Meng [1 ]
Liu, Wei [3 ]
Nie, Junlan [1 ,4 ]
机构
[1] Yanshan Univ, Sch Informat Sci & Engn, Qinhuangdao 066004, Hebei, Peoples R China
[2] Key Lab Software Engn Hebei Prov, Qinhuangdao 066004, Hebei, Peoples R China
[3] Univ Technol Sydney, Fac Engn & IT, Adv Analyt Inst, Ultimo, NSW 2007, Australia
[4] Key Lab Comp Virtual Technol & Syst Integrat Hebe, Qinhuangdao 066004, Hebei, Peoples R China
基金
美国国家科学基金会;
关键词
molecular cavity; topological representation; Word2Vec model; analogy-based methods; ALGORITHMS; EFFICIENT; CHANNELS; DYNAMICS;
D O I
10.3390/ijms20236019
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Cavity analysis in molecular dynamics is important for understanding molecular function. However, analyzing the dynamic pattern of molecular cavities remains a difficult task. In this paper, we propose a novel method to topologically represent molecular cavities by vectorization. First, a characterization of cavities is established through Word2Vec model, based on an analogy between the cavities and natural language processing (NLP) terms. Then, we use some techniques such as dimension reduction and clustering to conduct an exploratory analysis of the vectorized molecular cavity. On a real data set, we demonstrate that our approach is applicable to maintain the topological characteristics of the cavity and can find the change patterns from a large number of cavities.
引用
收藏
页数:14
相关论文
共 35 条
[1]   Continuous Distributed Representation of Biological Sequences for Deep Proteomics and Genomics [J].
Asgari, Ehsaneddin ;
Mofrad, Mohammad R. K. .
PLOS ONE, 2015, 10 (11)
[2]   cite2vec: Citation-Driven Document Exploration via Word Embeddings [J].
Berger, Matthew ;
McDonough, Katherine ;
Seversky, Lee M. .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2017, 23 (01) :691-700
[3]   Structural insight into the role of the ribosomal tunnel in cellular regulation [J].
Berisio, R ;
Schluenzen, F ;
Harms, J ;
Bashan, A ;
Auerbach, T ;
Baram, D ;
Yonath, A .
NATURE STRUCTURAL BIOLOGY, 2003, 10 (05) :366-370
[4]   CHEXVIS: a tool for molecular channel extraction and visualization [J].
Bin Masood, Talha ;
Sandhya, Sankaran ;
Chandra, Nagasuma ;
Natarajan, Vijay .
BMC BIOINFORMATICS, 2015, 16
[5]  
Burley SK, 2017, METHODS MOL BIOL, V1606, P627, DOI 10.1007/978-1-4939-7000-1_26
[6]   CAVER 3.0: A Tool for the Analysis of Transport Pathways in Dynamic Protein Structures [J].
Chovancova, Eva ;
Pavelka, Antonin ;
Benes, Petr ;
Strnad, Ondrej ;
Brezovsky, Jan ;
Kozlikova, Barbora ;
Gora, Artur ;
Sustr, Vilem ;
Klvana, Martin ;
Medek, Petr ;
Biedermannova, Lada ;
Sochor, Jiri ;
Damborsky, Jiri .
PLOS COMPUTATIONAL BIOLOGY, 2012, 8 (10)
[7]  
Hinton G.E., 1986, Parallel Distributed Processing: Explorations in the Microstructure of Cognition, P77
[8]   CAVER Analyst 2.0: analysis and visualization of channels and tunnels in protein structures and molecular dynamics trajectories [J].
Jurcik, Adam ;
Bednar, David ;
Byska, Jan ;
Marques, Sergio M. ;
Furmanova, Katarina ;
Daniel, Lukas ;
Kokkonen, Piia ;
Brezovsky, Jan ;
Strnad, Ondrej ;
Stourac, Jan ;
Pavelka, Antonin ;
Manak, Martin ;
Damborsky, Jiri ;
Kozlikova, Barbora .
BIOINFORMATICS, 2018, 34 (20) :3586-3588
[9]   Visibility-Based Approach to Surface Detection of Tunnels in Proteins [J].
Jurcik, Adam ;
Byska, Jan ;
Sochor, Jiri ;
Kozlikova, Barbora .
PROCEEDINGS SCCG: 2015 31ST SPRING CONFERENCE ON COMPUTER GRAPHICS, 2015, :64-71
[10]   BetaCavityWeb: a webserver for molecular voids and channels [J].
Kim, Jae-Kwan ;
Cho, Youngsong ;
Lee, Mokwon ;
Laskowski, Roman A. ;
Ryu, Seong Eon ;
Sugihara, Kokichi ;
Kim, Deok-Soo .
NUCLEIC ACIDS RESEARCH, 2015, 43 (W1) :W413-W418