Machine learning deciphers structural features of RNA duplexes measured with solution X-ray scattering

被引:9
作者
Chen, Yen-Lin [1 ]
Pollack, Lois [1 ]
机构
[1] Cornell Univ, Sch Appl & Engn Phys, Ithaca, NY 14853 USA
基金
美国国家科学基金会;
关键词
ribonucleic acids; machine learning; solution X-ray scattering; wide-angle X-ray scattering; computational modelling; SMALL-ANGLE SCATTERING; MOLECULAR-DYNAMICS; WEB SERVER; MACROMOLECULES;
D O I
10.1107/S2052252520008830
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Macromolecular structures can be determined from solution X-ray scattering. Small-angle X-ray scattering (SAXS) provides global structural information on length scales of 10s to 100s of Angstroms, and many algorithms are available to convert SAXS data into low-resolution structural envelopes. Extension of measurements to wider scattering angles (WAXS or wide-angle X-ray scattering) can sharpen the resolution to below 10 angstrom, filling in structural details that can be critical for biological function. These WAXS profiles are especially challenging to interpret because of the significant contribution of solvent in addition to solute on these smaller length scales. Based on training with molecular dynamics generated models, the application of extreme gradient boosting (XGBoost) is discussed, which is a supervised machine learning (ML) approach to interpret features in solution scattering profiles. These ML methods are applied to predict key structural parameters of double-stranded ribonucleic acid (dsRNA) duplexes. Duplex conformations vary with salt and sequence and directly impact the foldability of functional RNA molecules. The strong structural periodicities in these duplexes yield scattering profiles with rich sets of features at intermediate-to-wide scattering angles. In the ML models, these profiles are treated as 1D images or features. These ML models identify specific scattering angles, or regions of scattering angles, which correspond with and successfully predict distinct structural parameters. Thus, this work demonstrates that ML strategies can integrate theoretical molecular models with experimental solution scattering data, providing a new framework for extracting highly relevant structural information from solution experiments on biological macromolecules.
引用
收藏
页码:870 / 880
页数:11
相关论文
共 43 条
[1]   SoftWAXS: a computational tool for modeling wide-angle X-ray solution scattering from biomolecules [J].
Bardhan, Jaydeep ;
Park, Sanghyun ;
Makowski, Lee .
JOURNAL OF APPLIED CRYSTALLOGRAPHY, 2009, 42 :932-943
[2]   Structural characterization of flexible proteins using small-angle X-ray scattering [J].
Bernado, Pau ;
Mylonas, Efstratios ;
Petoukhov, Maxim V. ;
Blackledge, Martin ;
Svergun, Dmitri I. .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2007, 129 (17) :5656-5664
[3]  
Bezanson J., 2012, SIAM REV, V59, P1
[4]   CURVES plus web server for analyzing and visualizing the helical, backbone and groove parameters of nucleic acid structures [J].
Blanchet, Christophe ;
Pasi, Marco ;
Zakrzewska, Krystyna ;
Lavery, Richard .
NUCLEIC ACIDS RESEARCH, 2011, 39 :W68-W73
[5]   Small-Angle X-Ray Scattering on Biological Macromolecules and Nanocomposites in Solution [J].
Blanchet, Clement E. ;
Svergun, Dmitri I. .
ANNUAL REVIEW OF PHYSICAL CHEMISTRY, VOL 64, 2013, 64 :37-54
[6]   INVITRO SPLICING OF THE RIBOSOMAL-RNA PRECURSOR OF TETRAHYMENA - INVOLVEMENT OF A GUANOSINE NUCLEOTIDE IN THE EXCISION OF THE INTERVENING SEQUENCE [J].
CECH, TR ;
ZAUG, AJ ;
GRABOWSKI, PJ .
CELL, 1981, 27 (03) :487-496
[7]   Interpretation of Solution X-Ray Scattering by Explicit-Solvent Molecular Dynamics [J].
Chen, Po-chia ;
Hub, Jochen S. .
BIOPHYSICAL JOURNAL, 2015, 108 (10) :2573-2584
[8]   Validating Solution Ensembles from Molecular Dynamics Simulation by Wide-Angle X-ray Scattering Data [J].
Chen, Po-chia ;
Hub, Jochen S. .
BIOPHYSICAL JOURNAL, 2014, 107 (02) :435-447
[9]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[10]   Salt Dependence of A-Form RNA Duplexes: Structures and Implications [J].
Chen, Yen-Lin ;
Pollack, Lois .
JOURNAL OF PHYSICAL CHEMISTRY B, 2019, 123 (46) :9773-9785