Fast and Scalable Gaussian Process Modeling with Applications to Astronomical Time Series

被引:717
作者
Foreman-Mackey, Daniel [1 ,2 ]
Agol, Eric [1 ]
Ambikasaran, Sivaram [3 ]
Angus, Ruth [4 ]
机构
[1] Univ Washington, Dept Astron, Seattle, WA 98195 USA
[2] Flatiron Inst, Ctr Computat Astrophys, 162 5th Ave,6th Floor, New York, NY 10010 USA
[3] Indian Inst Sci, Dept Computat & Data Sci, Bangalore, Karnataka, India
[4] Columbia Univ, Dept Astron, 550 W 120th St, New York, NY 10027 USA
基金
美国国家科学基金会;
关键词
asteroseismology; methods: data analysis; methods: statistical; planetary systems; stars: rotation; PROCESS FRAMEWORK; LIGHT CURVES; KEPLER; VARIABILITY; EFFICIENT; ASTEROSEISMOLOGY; OSCILLATIONS; ALGORITHM; BINARIES; SPECTRA;
D O I
10.3847/1538-3881/aa9332
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
The growing field of large-scale time domain astronomy requires methods for probabilistic data analysis that are computationally tractable, even with large data sets. Gaussian processes (GPs) are a popular class of models used for this purpose, but since the computational cost scales, in general, as the cube of the number of data points, their application has been limited to small data sets. In this paper, we present a novel method for GPs modeling in one dimension where the computational requirements scale linearly with the size of the data set. We demonstrate the method by applying it to simulated and real astronomical time series data sets. These demonstrations are examples of probabilistic inference of stellar rotation periods, asteroseismic oscillation spectra, and transiting planet parameters. The method exploits structure in the problem when the covariance function is expressed as a mixture of complex exponentials, without requiring evenly spaced observations or uniform noise. This form of covariance arises naturally when the process is a mixture of stochastically driven damped harmonic oscillators-providing a physical motivation for and interpretation of this choice-but we also demonstrate that it can be a useful effective model in some other cases. We present a mathematical description of the method and compare it to existing scalable GP methods. The method is fast and interpretable, with a range of potential applications within astronomical data analysis and beyond. We provide well-tested and documented open-source implementations of this method in C++, Python, and Julia.
引用
收藏
页数:21
相关论文
共 103 条
[1]   On detecting terrestrial planets with timing of giant planet transits [J].
Agol, E ;
Steffen, J ;
Sari, R ;
Clarkson, W .
MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2005, 359 (02) :567-579
[2]   K2SC: flexible systematics correction and detrending of K2 light curves using Gaussian process regression [J].
Aigrain, S. ;
Parviainen, H. ;
Pope, B. J. S. .
MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2016, 459 (03) :2408-2419
[3]   Testing the recovery of stellar rotation signals from Kepler light curves using a blind hare-and-hounds exercise [J].
Aigrain, S. ;
Llama, J. ;
Ceillier, T. ;
das Chagas, M. L. ;
Davenport, J. R. A. ;
Garcia, R. A. ;
Hay, K. L. ;
Lanza, A. F. ;
McQuillan, A. ;
Mazeh, T. ;
de Medeiros, J. R. ;
Nielsen, M. B. ;
Reinhold, T. .
MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2015, 450 (03) :3211-3226
[4]   Generalized Rybicki Press algorithm [J].
Ambikasaran, Sivaram .
NUMERICAL LINEAR ALGEBRA WITH APPLICATIONS, 2015, 22 (06) :1102-1114
[5]   Fast Direct Methods for Gaussian Processes [J].
Ambikasaran, Sivaram ;
Foreman-Mackey, Daniel ;
Greengard, Leslie ;
Hogg, David W. ;
O'Neil, Michael .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (02) :252-265
[6]  
ANDERSON E., 1999, LAPACK USERSGUIDE, V3rd
[7]   MODELING OF SOLAR OSCILLATION POWER SPECTRA [J].
ANDERSON, ER ;
DUVALL, TL ;
JEFFERIES, SM .
ASTROPHYSICAL JOURNAL, 1990, 364 (02) :699-705
[8]  
Angus R., 2017, ARXIV170605459
[9]  
[Anonymous], 2013, INT C MACHINE LEARNI
[10]  
[Anonymous], 2015, ARXIV150303757