共 50 条
AUTOMATIC CLASSIFICATION OF VARIABLE STARS IN CATALOGS WITH MISSING DATA
被引:33
|作者:
Pichara, Karim
[1
,2
,3
]
Protopapas, Pavlos
[2
,4
]
机构:
[1] Pontificia Univ Catolica Chile, Dept Comp Sci, Santiago, Chile
[2] Harvard Univ, Inst Appl Computat Sci, Cambridge, MA 02138 USA
[3] Milky Way Millennium Nucleus, Santiago 7820436, Chile
[4] Harvard Smithsonian Ctr Astrophys, Cambridge, MA 02138 USA
关键词:
methods: data analysis;
stars: statistics;
stars: variables: general;
LARGE-MAGELLANIC-CLOUD;
TIME-SERIES;
CANDIDATES;
SELECTION;
D O I:
10.1088/0004-637X/777/2/83
中图分类号:
P1 [天文学];
学科分类号:
0704 ;
摘要:
We present an automatic classification method for astronomical catalogs with missing data. We use Bayesian networks and a probabilistic graphical model that allows us to perform inference to predict missing values given observed data and dependency relationships between variables. To learn a Bayesian network from incomplete data, we use an iterative algorithm that utilizes sampling methods and expectation maximization to estimate the distributions and probabilistic dependencies of variables from data with missing values. To test our model, we use three catalogs with missing data (SAGE, Two Micron All Sky Survey, and UBVI) and one complete catalog (MACHO). We examine how classification accuracy changes when information from missing data catalogs is included, how our method compares to traditional missing data approaches, and at what computational cost. Integrating these catalogs with missing data, we find that classification of variable objects improves by a few percent and by 15% for quasar detection while keeping the computational cost the same.
引用
收藏
页数:10
相关论文