Discovery of Self-Assembling π-Conjugated Peptides by Active Learning-Directed Coarse-Grained Molecular Simulation

被引:87
作者
Shmilovich, Kirill [1 ]
Mansbach, Rachael A. [2 ]
Sidky, Hythem [1 ]
Dunne, Olivia E. [1 ]
Panda, Sayak Subhra [3 ,4 ]
Tovar, John D. [5 ,6 ]
Ferguson, Andrew L. [1 ]
机构
[1] Univ Chicago, Pritzker Sch Mol Engn, Chicago, IL 60637 USA
[2] Los Alamos Natl Lab, Theoret Biol & Biophys Grp, Los Alamos, NM 87545 USA
[3] Johns Hopkins Univ, Dept Chem, Baltimore, MD 21218 USA
[4] Johns Hopkins Univ, Inst NanoBioTechnol, Baltimore, MD 21218 USA
[5] Johns Hopkins Univ, Inst NanoBioTechnol, Dept Chem, Baltimore, MD 21218 USA
[6] Johns Hopkins Univ, Dept Mat Sci & Engn, Baltimore, MD 21218 USA
基金
美国国家科学基金会;
关键词
MESOSCALE SIMULATION; EXPERIMENTAL-DESIGN; FORCE-FIELD; NANOSTRUCTURES; POLYMERS; MODEL; OLIGOPEPTIDES; OPTIMIZATION; AGGREGATION; PARAMETERS;
D O I
10.1021/acs.jpcb.0c00708
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
Electronically active organic molecules have demonstrated great promise as novel soft materials for energy harvesting and transport. Self-assembled nanoaggregates formed from pi-conjugated oligopeptides composed of an aromatic core flanked by oligopeptide wings offer emergent optoelectronic properties within a water-soluble and biocompatible substrate. Nanoaggregate properties can be controlled by tuning core chemistry and peptide composition, but the sequence-structure-function relations remain poorly characterized. In this work, we employ coarse-grained molecular dynamics simulations within an active learning protocol employing deep representational learning and Bayesian optimization to efficiently identify molecules capable of assembling pseudo-1D nanoaggregates with good stacking of the electronically active pi-cores. We consider the DXXX-OPV3-XXXD oligopeptide family, where D is an Asp residue and OPV3 is an oligophenylenevinylene oligomer (1,4-distyrylbenzene), to identify the top performing XXX tripeptides within all 20(3) = 8000 possible sequences. By direct simulation of only 2.3% of this space, we identify molecules predicted to exhibit superior assembly relative to those reported in prior work. Spectral clustering of the top candidates reveals new design rules governing assembly. This work establishes new understanding of DXXX-OPV3-XXXD assembly, identifies promising new candidates for experimental testing, and presents a computational design platform that can be generically extended to other peptide-based and peptide-like systems.
引用
收藏
页码:3873 / 3891
页数:19
相关论文
共 105 条
[1]   Gromacs: High performance molecular simulations through multi-level parallelism from laptops to supercomputers [J].
Abraham, Mark James ;
Murtola, Teemu ;
Schulz, Roland ;
Páll, Szilárd ;
Smith, Jeremy C. ;
Hess, Berk ;
Lindah, Erik .
SoftwareX, 2015, 1-2 :19-25
[2]  
[Anonymous], ARXIV150406329
[3]  
[Anonymous], 2006, Data_analysis:_a_Bayesian_tutorial
[4]  
[Anonymous], STRUCTURE PROPERTIES
[5]  
[Anonymous], ARXIV170704572
[6]  
[Anonymous], 2014, ARXIV14095165
[7]  
[Anonymous], 2010, COMPUT SCI
[8]  
[Anonymous], 2000, LESS IS MOREZ ACTIVE
[9]  
[Anonymous], 2006, NIPS
[10]  
[Anonymous], THESIS