Convolutional neural networks as an alternative to Bayesian retrievals for interpreting exoplanet transmission spectra

被引:13
|
作者
Martinez, F. Ardevol [1 ,2 ,3 ,4 ]
Min, M. [2 ]
Kamp, I [1 ]
Palmer, P., I [3 ,4 ]
机构
[1] Univ Groningen, Kapteyn Astron Inst, Groningen, Netherlands
[2] Netherlands Space Res Inst SRON, Leiden, Netherlands
[3] Univ Edinburgh, Ctr Exoplanet Sci, Edinburgh, Midlothian, Scotland
[4] Univ Edinburgh, Sch GeoSci, Edinburgh, Midlothian, Scotland
关键词
planets and satellites: atmospheres; planets and satellites: gaseous planets; planets and satellites: composition; ATMOSPHERES; RESOLUTION; INFERENCE; CHEMISTRY; IMPACT; HOT;
D O I
10.1051/0004-6361/202142976
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
Context. Exoplanet observations are currently analysed with Bayesian retrieval techniques to constrain physical and chemical properties of their atmospheres. Due to the computational load of the models used to analyse said observations, a compromise is usually needed between model complexity and computing time. Analyses of observational data from future facilities, such as the James Webb Space Telescope (JWST), will require more complex models, and this will increase the computational load of retrievals, prompting the search for a faster approach for interpreting exoplanet observations. Aims. Our goal is to compare machine learning retrievals of exoplanet transmission spectra with nested sampling (Bayesian retrieval) and to understand if machine learning can be as reliable as a Bayesian retrieval for a statistically significant sample of spectra while being orders of magnitude faster. Methods. We generated grids of synthetic transmission spectra and their corresponding planetary and atmospheric parameters, with one using free chemistry models and the other using equilibrium chemistry models. Each grid was subsequently rebinned to simulate both Hubble Space Telescope, Wide Field Camera 3 (WFC3), and JWST Near-InfraRed Spectrograph observations, yielding four datasets in total. Convolutional neural networks (CNNs) were trained with each of the datasets. We performed retrievals for a set of 1000 simulated observations for each combination of model type and instrument with nested sampling and machine learning. We also used both methods to perform retrievals for real WFC3 transmission spectra of 48 exoplanets. Additionally, we carried out experiments to test how robust machine learning and nested sampling are against incorrect assumptions in our models. Results. Convolutional neural networks reached a lower coefficient of determination between predicted and true values of the parameters. Neither CNNs nor nested sampling systematically reached a lower bias for all parameters. Nested sampling underestimated the uncertainty in similar to 8% of retrievals, whereas CNNs correctly estimated the uncertainties. When performing retrievals for real WFC3 observations, nested sampling and machine learning agreed within 2 sigma for similar to 86% of spectra. When doing retrievals with incorrect assumptions, nested sampling underestimated the uncertainty in similar to 12% to similar to 41% of cases, whereas for the CNNs this fraction always remained below similar to 10%.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] The Importance of Optical Wavelength Data on Atmospheric Retrievals of Exoplanet Transmission Spectra
    Fairman, Charlotte
    Wakeford, Hannah R.
    Macdonald, Ryan J.
    ASTRONOMICAL JOURNAL, 2024, 167 (05):
  • [2] Exoplanet cartography using convolutional neural networks
    Meinke, K.
    Stam, D. M.
    Visser, P. M.
    ASTRONOMY & ASTROPHYSICS, 2022, 664
  • [3] INTERPRETING CONVOLUTIONAL NEURAL NETWORKS BY EXPLAINING THEIR PREDICTIONS
    Meynen, Toon
    Behzadi-Khormouji, Hamed
    Oramas, Jose
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1685 - 1689
  • [4] Interpreting Adversarially Trained Convolutional Neural Networks
    Zhang, Tianyuan
    Zhu, Zhanxing
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [5] Bayesian Hierarchical Convolutional Neural Networks
    Bensen, Alexis
    Kahana, Adam
    Woods, Zerotti
    ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS V, 2023, 12538
  • [6] Computing Transiting Exoplanet Parameters with 1D Convolutional Neural Networks
    Iglesias Alvarez, Santiago
    Diez Alonso, Enrique
    Sanchez Rodriguez, Maria Luisa
    Rodriguez Rodriguez, Javier
    Perez Fernandez, Saul
    de Cos Juez, Francisco Javier
    AXIOMS, 2024, 13 (02)
  • [7] Interpreting wde-band neural activity using convolutional neural networks
    Frey, Markus
    Tanni, Sander
    Perrodin, Catherine
    O'Leary, Alice
    Nau, Matthias
    Kelly, Jack
    Banino, Andrea
    Bendor, Daniel
    Lefort, Julie
    Doeller, Christian F.
    Barry, Caswell
    ELIFE, 2021, 10
  • [8] Bayesian Convolutional Neural Networks for Seismic Facies Classification
    Feng, Runhai
    Balling, Niels
    Grana, Dario
    Dramsch, Jesper Soren
    Hansen, Thomas Mejer
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (10): : 8933 - 8940
  • [9] Application of convolutional neural networks based on Bayesian optimization to landslide susceptibility mapping of transmission tower foundation
    Mansheng Lin
    Shuai Teng
    Gongfa Chen
    Bo Hu
    Bulletin of Engineering Geology and the Environment, 2023, 82
  • [10] Application of convolutional neural networks based on Bayesian optimization to landslide susceptibility mapping of transmission tower foundation
    Lin, Mansheng
    Teng, Shuai
    Chen, Gongfa
    Hu, Bo
    BULLETIN OF ENGINEERING GEOLOGY AND THE ENVIRONMENT, 2023, 82 (02)