Automatic Survey-invariant Classification of Variable Stars

被引:11
作者
Benavente, Patricio [1 ]
Protopapas, Pavlos [2 ]
Pichara, Karim [1 ,2 ,3 ]
机构
[1] Pontificia Univ Catolica Chile, Sch Engn, Comp Sci Dept, Santiago, Chile
[2] Harvard Univ, Inst Appl Computat Sci, Cambridge, MA 02138 USA
[3] Millennium Inst Astrophys, Santiago, Chile
关键词
methods: data analysis; methods: statistical; stars: statistics; stars: variables: general; SPECTRAL CLASSIFICATION; DOMAIN ADAPTATION; GALAXIES;
D O I
10.3847/1538-4357/aa7f2d
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
Machine learning techniques have been successfully used to classify variable stars on widely studied astronomical surveys. These data sets have been available to astronomers long enough, thus allowing them to perform deep analysis over several variable sources and generating useful catalogs with identified variable stars. The products of these studies are labeled data that enable supervised learning models to be trained successfully. However, when these models are blindly applied to data from new sky surveys, their performance drops significantly. Furthermore, unlabeled data become available at a much higher rate than their labeled counterpart, since labeling is a manual and time-consuming effort. Domain adaptation techniques aim to learn from a domain where labeled data are available, the source domain, and through some adaptation perform well on a different domain, the target domain. We propose a full probabilistic model that represents the joint distribution of features from two surveys, as well as a probabilistic transformation of the features from one survey to the other. This allows us to transfer labeled data to a study where they are not available and to effectively run a variable star classification model in a new survey. Our model represents the features of each domain as a Gaussian mixture and models the transformation as a translation, rotation, and scaling of each separate component. We perform tests using three different variability catalogs, EROS, MACHO, and HiTS, presenting differences among them, such as the number of observations per star, cadence, observational time, and optical bands observed, among others.
引用
收藏
页数:18
相关论文
共 58 条
  • [11] Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
  • [12] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [13] PHOTOMETRIC REDSHIFTS FOR QUASARS IN MULTI-BAND SURVEYS
    Brescia, M.
    Cavuoti, S.
    D'Abrusco, R.
    Longo, G.
    Mercurio, A.
    [J]. ASTROPHYSICAL JOURNAL, 2013, 772 (02)
  • [14] Spectral classification and luminosity function of galaxies in the Las Campanas Redshift Survey
    Bromley, BC
    Press, WH
    Lin, H
    Kirshner, RP
    [J]. ASTROPHYSICAL JOURNAL, 1998, 505 (01) : 25 - 36
  • [15] Photometric classification of emission line galaxies with machine-learning methods
    Cavuoti, Stefano
    Brescia, Massimo
    D'Abrusco, Raffaele
    Longo, Giuseppe
    Paolillo, Maurizio
    [J]. MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2014, 437 (01) : 968 - 975
  • [16] Chan YS, 2005, 19TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-05), P1010
  • [17] Statistical methods of automatic spectral classification and their application to the Hamburg/ESO Survey
    Christlieb, N
    Wisotzki, L
    Grasshoff, G
    [J]. ASTRONOMY & ASTROPHYSICS, 2002, 391 (01) : 397 - 406
  • [18] Automated Solar Activity Prediction: A hybrid computer platform using machine learning and solar imaging for automated prediction of solar flares
    Colak, T.
    Qahwaji, R.
    [J]. SPACE WEATHER-THE INTERNATIONAL JOURNAL OF RESEARCH AND APPLICATIONS, 2009, 7
  • [19] ANNz:: Estimating photometric redshifts using artificial neural networks
    Collister, AA
    Lahav, O
    [J]. PUBLICATIONS OF THE ASTRONOMICAL SOCIETY OF THE PACIFIC, 2004, 116 (818) : 345 - 351
  • [20] Cook KH, 1995, IAU SYMP, P357