Convolutional neural networks as an alternative to Bayesian retrievals for interpreting exoplanet transmission spectra

被引:13
|
作者
Martinez, F. Ardevol [1 ,2 ,3 ,4 ]
Min, M. [2 ]
Kamp, I [1 ]
Palmer, P., I [3 ,4 ]
机构
[1] Univ Groningen, Kapteyn Astron Inst, Groningen, Netherlands
[2] Netherlands Space Res Inst SRON, Leiden, Netherlands
[3] Univ Edinburgh, Ctr Exoplanet Sci, Edinburgh, Midlothian, Scotland
[4] Univ Edinburgh, Sch GeoSci, Edinburgh, Midlothian, Scotland
关键词
planets and satellites: atmospheres; planets and satellites: gaseous planets; planets and satellites: composition; ATMOSPHERES; RESOLUTION; INFERENCE; CHEMISTRY; IMPACT; HOT;
D O I
10.1051/0004-6361/202142976
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
Context. Exoplanet observations are currently analysed with Bayesian retrieval techniques to constrain physical and chemical properties of their atmospheres. Due to the computational load of the models used to analyse said observations, a compromise is usually needed between model complexity and computing time. Analyses of observational data from future facilities, such as the James Webb Space Telescope (JWST), will require more complex models, and this will increase the computational load of retrievals, prompting the search for a faster approach for interpreting exoplanet observations. Aims. Our goal is to compare machine learning retrievals of exoplanet transmission spectra with nested sampling (Bayesian retrieval) and to understand if machine learning can be as reliable as a Bayesian retrieval for a statistically significant sample of spectra while being orders of magnitude faster. Methods. We generated grids of synthetic transmission spectra and their corresponding planetary and atmospheric parameters, with one using free chemistry models and the other using equilibrium chemistry models. Each grid was subsequently rebinned to simulate both Hubble Space Telescope, Wide Field Camera 3 (WFC3), and JWST Near-InfraRed Spectrograph observations, yielding four datasets in total. Convolutional neural networks (CNNs) were trained with each of the datasets. We performed retrievals for a set of 1000 simulated observations for each combination of model type and instrument with nested sampling and machine learning. We also used both methods to perform retrievals for real WFC3 transmission spectra of 48 exoplanets. Additionally, we carried out experiments to test how robust machine learning and nested sampling are against incorrect assumptions in our models. Results. Convolutional neural networks reached a lower coefficient of determination between predicted and true values of the parameters. Neither CNNs nor nested sampling systematically reached a lower bias for all parameters. Nested sampling underestimated the uncertainty in similar to 8% of retrievals, whereas CNNs correctly estimated the uncertainties. When performing retrievals for real WFC3 observations, nested sampling and machine learning agreed within 2 sigma for similar to 86% of spectra. When doing retrievals with incorrect assumptions, nested sampling underestimated the uncertainty in similar to 12% to similar to 41% of cases, whereas for the CNNs this fraction always remained below similar to 10%.
引用
收藏
页数:23
相关论文
共 50 条
  • [31] UNCERTAINTY ESTIMATION IN BAYESIAN CONVOLUTIONAL NEURAL NETWORKS FOR SAR SHIP CLASSIFICATION
    Al Hinai, Al Adil
    Guida, Rafaella
    2024 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2024), 2024, : 7604 - 7608
  • [32] Uplink NOMA signal transmission with convolutional neural networks approach
    Lin Chuan
    Chang Qing
    Li Xianxu
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2020, 31 (05) : 890 - 898
  • [33] Bayesian Weight Decay on Bounded Approximation for Deep Convolutional Neural Networks
    Park, Jung-Guk
    Jo, Sungho
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (09) : 2866 - 2875
  • [34] Uncertainty Quantification in Inverse Scattering Problems With Bayesian Convolutional Neural Networks
    Wei, Zhun
    Chen, Xudong
    IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION, 2021, 69 (06) : 3409 - 3418
  • [35] Interpreting convolutional neural networks' low-dimensional approximation to quantum spin systems
    Ju, Yilong
    Alam, Shah Saad
    Minoff, Jonathan
    Anselmi, Fabio
    Pu, Han
    Patel, Ankit
    Physical Review Research, 7 (01):
  • [36] Pixel-wise confidence estimation for segmentation in Bayesian Convolutional Neural Networks
    Martin, Remi
    Duong, Luc
    MACHINE VISION AND APPLICATIONS, 2023, 34 (01)
  • [37] Fast-BCNN: Massive Neuron Skipping in Bayesian Convolutional Neural Networks
    Wan, Qiyu
    Fu, Xin
    2020 53RD ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO 2020), 2020, : 229 - 240
  • [38] Pixel-wise confidence estimation for segmentation in Bayesian Convolutional Neural Networks
    Rémi Martin
    Luc Duong
    Machine Vision and Applications, 2023, 34
  • [39] Application of convolutional neural networks featuring Bayesian optimization for landslide susceptibility assessment
    Sameen, Maher Ibrahim
    Pradhan, Biswajeet
    Lee, Saro
    CATENA, 2020, 186
  • [40] Fast Bayesian gravitational wave parameter estimation using convolutional neural networks
    Andres-Carcasona, M.
    Martinez, M.
    Mir, Ll M.
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2024, 527 (02) : 2887 - 2894