Convolutional neural networks as an alternative to Bayesian retrievals for interpreting exoplanet transmission spectra

被引:13
|
作者
Martinez, F. Ardevol [1 ,2 ,3 ,4 ]
Min, M. [2 ]
Kamp, I [1 ]
Palmer, P., I [3 ,4 ]
机构
[1] Univ Groningen, Kapteyn Astron Inst, Groningen, Netherlands
[2] Netherlands Space Res Inst SRON, Leiden, Netherlands
[3] Univ Edinburgh, Ctr Exoplanet Sci, Edinburgh, Midlothian, Scotland
[4] Univ Edinburgh, Sch GeoSci, Edinburgh, Midlothian, Scotland
关键词
planets and satellites: atmospheres; planets and satellites: gaseous planets; planets and satellites: composition; ATMOSPHERES; RESOLUTION; INFERENCE; CHEMISTRY; IMPACT; HOT;
D O I
10.1051/0004-6361/202142976
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
Context. Exoplanet observations are currently analysed with Bayesian retrieval techniques to constrain physical and chemical properties of their atmospheres. Due to the computational load of the models used to analyse said observations, a compromise is usually needed between model complexity and computing time. Analyses of observational data from future facilities, such as the James Webb Space Telescope (JWST), will require more complex models, and this will increase the computational load of retrievals, prompting the search for a faster approach for interpreting exoplanet observations. Aims. Our goal is to compare machine learning retrievals of exoplanet transmission spectra with nested sampling (Bayesian retrieval) and to understand if machine learning can be as reliable as a Bayesian retrieval for a statistically significant sample of spectra while being orders of magnitude faster. Methods. We generated grids of synthetic transmission spectra and their corresponding planetary and atmospheric parameters, with one using free chemistry models and the other using equilibrium chemistry models. Each grid was subsequently rebinned to simulate both Hubble Space Telescope, Wide Field Camera 3 (WFC3), and JWST Near-InfraRed Spectrograph observations, yielding four datasets in total. Convolutional neural networks (CNNs) were trained with each of the datasets. We performed retrievals for a set of 1000 simulated observations for each combination of model type and instrument with nested sampling and machine learning. We also used both methods to perform retrievals for real WFC3 transmission spectra of 48 exoplanets. Additionally, we carried out experiments to test how robust machine learning and nested sampling are against incorrect assumptions in our models. Results. Convolutional neural networks reached a lower coefficient of determination between predicted and true values of the parameters. Neither CNNs nor nested sampling systematically reached a lower bias for all parameters. Nested sampling underestimated the uncertainty in similar to 8% of retrievals, whereas CNNs correctly estimated the uncertainties. When performing retrievals for real WFC3 observations, nested sampling and machine learning agreed within 2 sigma for similar to 86% of spectra. When doing retrievals with incorrect assumptions, nested sampling underestimated the uncertainty in similar to 12% to similar to 41% of cases, whereas for the CNNs this fraction always remained below similar to 10%.
引用
收藏
页数:23
相关论文
共 50 条
  • [21] A Multilayer Network- Based Approach for Interpreting and Compressing Convolutional Neural Networks
    Virgili, Luca
    Amelio, Alessia
    ERCIM NEWS, 2023, (134):
  • [22] Interpreting Convolutional Neural Networks via Layer-Wise Relevance Propagation
    Jia, Wohuan
    Zhang, Shaoshuai
    Jiang, Yue
    Xu, Li
    ARTIFICIAL INTELLIGENCE AND SECURITY, ICAIS 2022, PT I, 2022, 13338 : 457 - 467
  • [23] Convolutional neural networks for the reconstruction of spectra in compressive sensing spectrometers
    Kim, Cheolsun
    Park, Dongju
    Lee, Heung-No
    OPTICAL DATA SCIENCE II, 2019, 10937
  • [24] Prediction of urban flood inundation using Bayesian convolutional neural networks
    Zheng, Xiang
    Zheng, Minling
    STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2024, 38 (11) : 4485 - 4500
  • [25] Hyperparameter Optimization for Convolutional Neural Networks with Genetic Algorithms and Bayesian Optimization
    Puentes G, David E.
    Barrios H, Carlos J.
    Navaux, Philippe O. A.
    2022 IEEE LATIN AMERICAN CONFERENCE ON COMPUTATIONAL INTELLIGENCE (LA-CCI), 2022, : 131 - 135
  • [26] Bayesian Graph Convolutional Neural Networks for Semi-Supervised Classification
    Zhang, Yingxue
    Pal, Soumyasundar
    Coates, Mark
    Ustebay, Deniz
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 5829 - 5836
  • [27] Where to model the epistemic uncertainty of Bayesian convolutional neural networks for classification
    Tai, Yuan
    Tan, Yihua
    Zou, Erbo
    Lei, Bo
    Fan, Qiang
    He, Yizhou
    NEUROCOMPUTING, 2024, 583
  • [28] Uncertainty Assisted Robust Tuberculosis Identification With Bayesian Convolutional Neural Networks
    Ul Abideen, Zain
    Ghafoor, Mubeen
    Munir, Kamran
    Saqib, Madeeha
    Ullah, Ata
    Zia, Tehseen
    Tariq, Syed Ali
    Ahmed, Ghufran
    Zahra, Asma
    IEEE ACCESS, 2020, 8 (08): : 22812 - 22825
  • [29] Uplink NOMA signal transmission with convolutional neural networks approach
    LIN Chuan
    CHANG Qing
    LI Xianxu
    JournalofSystemsEngineeringandElectronics, 2020, 31 (05) : 890 - 898
  • [30] Slope reliability analysis using Bayesian optimized convolutional neural networks
    Lin, Houlai
    Li, Liang
    Meng, Kaiqi
    Li, Chunli
    Xu, Liang
    Liu, Zhiliang
    Lu, Shibao
    ENGINEERING COMPUTATIONS, 2022, 39 (08) : 3012 - 3037