Machine and Deep Learning applied to galaxy morphology - A comparative study

被引:78
作者
Barchi, P. H. [1 ,2 ]
de Carvalho, R. R. [3 ,4 ]
Rosa, R. R. [1 ]
Sautter, R. A. [1 ]
Soares-Santos, M. [2 ]
Marques, B. A. D. [5 ]
Clua, E. [5 ]
Goncalves, T. S. [6 ]
de Sa-Freitas, C. [6 ]
Moura, T. C. [7 ]
机构
[1] Natl Inst Space Res INPE, Lab Comp & Appl Math, Av Astronautas 1-758, BR-12227010 Sao Jose Dos Campos, SP, Brazil
[2] Brandeis Univ, Phys Dept, Waltham, MA 02254 USA
[3] Univ Cidade Sao Paulo, NAT Univ Cruzeiro Sul, Sao Paulo, Brazil
[4] Natl Inst Space Res INPE, Astrophys Div, Sao Jose Dos Campos, Brazil
[5] Fed Fluminense Univ UFF, Inst Comp, Niteroi, RJ, Brazil
[6] Fed Univ Rio De Janeiro UFRJ, Valongo Observ, Rio De Janeiro, Brazil
[7] Sao Paulo Univ USP, Inst Astron Geofis & Ciencias Atmosfer IAG, Sao Paulo, Brazil
基金
巴西圣保罗研究基金会;
关键词
Galaxies: photometry; Methods: data analysis; Machine learning; Techniques: image processing; Galaxies: General; Catalogs; GRADIENT PATTERN-ANALYSIS; STAR-FORMATION; CLASSIFICATIONS; ZOO; PARAMETERS; NEARBY;
D O I
10.1016/j.ascom.2019.100334
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
Morphological classification is a key piece of information to define samples of galaxies aiming to study the large-scale structure of the universe. In essence, the challenge is to build up a robust methodology to perform a reliable morphological estimate from galaxy images. Here, we investigate how to substantially improve the galaxy classification within large datasets by mimicking human classification. We combine accurate visual classifications from the Galaxy Zoo project with machine and deep learning methodologies. We propose two distinct approaches for galaxy morphology: one based on non-parametric morphology and traditional machine learning algorithms; and another based on Deep Learning. To measure the input features for the traditional machine learning methodology, we have developed a system called CyMorph, with a novel non-parametric approach to study galaxy morphology. The main datasets employed comes from the Sloan Digital Sky Survey Data Release 7 (SDSS-DR7). We also discuss the class imbalance problem considering three classes. Performance of each model is mainly measured by Overall Accuracy (OA). A spectroscopic validation with astrophysical parameters is also provided for Decision Tree models to assess the quality of our morphological classification. In all of our samples, both Deep and Traditional Machine Learning approaches have over 94.5% OA to classify galaxies in two classes (elliptical and spiral). We compare our classification with state-of-the-art morphological classification from literature. Considering only two classes separation, we achieve 99% of overall accuracy in average when using our deep learning models, and 82% when using three classes. We provide a catalog with 670,560 galaxies containing our best results, including morphological metrics and classification. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页数:17
相关论文
共 65 条
[1]   The Dark Energy Survey: more than dark energy - an overview [J].
Abbott, T. ;
Abdalla, F. B. ;
Aleksic, J. ;
Allam, S. ;
Amara, A. ;
Bacon, D. ;
Balbinot, E. ;
Banerji, M. ;
Bechtol, K. ;
Benoit-Levy, A. ;
Bernstein, G. M. ;
Bertin, E. ;
Blazek, J. ;
Bonnett, C. ;
Bridle, S. ;
Brooks, D. ;
Brunner, R. J. ;
Buckley-Geer, E. ;
Burke, D. L. ;
Caminha, G. B. ;
Capozzi, D. ;
Carlsen, J. ;
Carnero-Rosell, A. ;
Carollo, M. ;
Carrasco-Kind, M. ;
Carretero, J. ;
Castander, F. J. ;
Clerkin, L. ;
Collett, T. ;
Conselice, C. ;
Crocce, M. ;
Cunha, C. E. ;
D'Andrea, C. B. ;
da Costa, L. N. ;
Davis, T. M. ;
Desai, S. ;
Diehl, H. T. ;
Dietrich, J. P. ;
Dodelson, S. ;
Doel, P. ;
Drlica-Wagner, A. ;
Estrada, J. ;
Etherington, J. ;
Evrard, A. E. ;
Fabbri, J. ;
Finley, D. A. ;
Flaugher, B. ;
Foley, R. J. ;
Fosalba, P. ;
Frieman, J. .
MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2016, 460 (02) :1270-1299
[2]   The morphologies of distant galaxies .2. Classifications from the Hubble Space Telescope Medium Deep Survey [J].
Abraham, RG ;
vandenBergh, S ;
Glazebrook, K ;
Ellis, RS ;
Santiago, BX ;
Surma, P ;
Griffiths, RE .
ASTROPHYSICAL JOURNAL SUPPLEMENT SERIES, 1996, 107 (01) :1-&
[3]  
[Anonymous], ANALISE IMAGENS DIGI
[4]  
[Anonymous], SCIENCES
[5]  
[Anonymous], 2005, APPL MULTIVARIATE AN
[6]  
[Anonymous], 2016, Deep Learning
[7]  
[Anonymous], THESIS
[8]  
[Anonymous], 2012, ADV MACHINE LEARNING, DOI DOI 10.1201/B11822
[9]  
[Anonymous], J COMPUT INTERDISCIP
[10]  
[Anonymous], UNSUPERVISED LEARNIN