Galaxy Zoo: reproducing galaxy morphologies via machine learning☆

被引:153
作者
Banerji, Manda [1 ,2 ]
Lahav, Ofer [1 ]
Lintott, Chris J. [3 ]
Abdalla, Filipe B. [1 ]
Schawinski, Kevin [4 ,5 ]
Bamford, Steven P. [6 ]
Andreescu, Dan [7 ]
Murray, Phil [8 ]
Raddick, M. Jordan [9 ]
Slosar, Anze [10 ,11 ]
Szalay, Alex [9 ]
Thomas, Daniel [12 ]
Vandenberg, Jan [9 ]
机构
[1] UCL, Dept Phys & Astron, London WC1E 6BT, England
[2] Univ Cambridge, Inst Astron, Cambridge CB3 0HA, England
[3] Univ Oxford, Dept Phys, Oxford OX1 3RH, England
[4] Yale Univ, Dept Phys, New Haven, CT 06511 USA
[5] Yale Univ, Yale Ctr Astron & Astrophys, New Haven, CT 06520 USA
[6] Univ Nottingham, Sch Phys & Astron, Ctr Astron & Particle Theory, Nottingham NG7 2RD, England
[7] LinkLab, Bronx, NY 10471 USA
[8] Fingerprint Digital Media, Newtownards BT23 7GY, Co Down, North Ireland
[9] Johns Hopkins Univ, Dept Phys & Astron, Baltimore, MD 21218 USA
[10] Univ Calif Berkeley, Lawrence Berkeley Natl Lab, Berkeley Ctr Cosmol Phys, Berkeley, CA 94720 USA
[11] Univ Calif Berkeley, Dept Phys, Berkeley, CA 94720 USA
[12] Univ Portsmouth, Inst Cosmol & Gravitat, Portsmouth PO1 2EG, Hants, England
关键词
methods: data analysis; galaxies: general; DIGITAL-SKY-SURVEY; ARTIFICIAL NEURAL-NETWORKS; ESTIMATING PHOTOMETRIC REDSHIFTS; AUTOMATED CLASSIFICATION; STELLAR SPECTRA; COLOR;
D O I
10.1111/j.1365-2966.2010.16713.x
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
We present morphological classifications obtained using machine learning for objects in the Sloan Digital Sky Survey DR6 that have been classified by Galaxy Zoo into three classes, namely early types, spirals and point sources/artefacts. An artificial neural network is trained on a subset of objects classified by the human eye, and we test whether the machine-learning algorithm can reproduce the human classifications for the rest of the sample. We find that the success of the neural network in matching the human classifications depends crucially on the set of input parameters chosen for the machine-learning algorithm. The colours and parameters associated with profile fitting are reasonable in separating the objects into three classes. However, these results are considerably improved when adding adaptive shape parameters as well as concentration and texture. The adaptive moments, concentration and texture parameters alone cannot distinguish between early type galaxies and the point sources/artefacts. Using a set of 12 parameters, the neural network is able to reproduce the human classifications to better than 90 per cent for all three morphological classes. We find that using a training set that is incomplete in magnitude does not degrade our results given our particular choice of the input parameters to the network. We conclude that it is promising to use machine-learning algorithms to perform morphological classification for the next generation of wide-field imaging surveys and that the Galaxy Zoo catalogue provides an invaluable training set for such purposes.
引用
收藏
页码:342 / 353
页数:12
相关论文
共 29 条
[11]   Estimating photometric redshifts with artificial neural networks [J].
Firth, AE ;
Lahav, O ;
Somerville, RS .
MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2003, 339 (04) :1195-1202
[12]   An artificial neural network approach to the classification of galaxy spectra [J].
Folkes, SR ;
Lahav, O ;
Maddox, SJ .
MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 1996, 283 (02) :651-665
[13]   A catalog of morphologically classified galaxies from the Sloan Digital Sky Survey: North equatorial region [J].
Fukugita, Masataka ;
Nakamura, Osamu ;
Okamura, Sadanori ;
Yasuda, Naoki ;
Barentine, John C. ;
Brinkmann, Jon ;
Gunn, James E. ;
Harvanek, Mike ;
Ichikawa, Takashi ;
Lupton, Robert H. ;
Schneider, Donald P. ;
Strauss, Michael A. ;
York, Donald G. .
ASTRONOMICAL JOURNAL, 2007, 134 (02) :579-593
[14]   Neural computation as a tool for galaxy classification: Methods and examples [J].
Lahav, O ;
Naim, A ;
Sodre, L ;
Storrie-Lombardi, MC .
MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 1996, 283 (01) :207-221
[15]   GALAXIES, HUMAN EYES, AND ARTIFICIAL NEURAL NETWORKS [J].
LAHAV, O ;
NAIM, A ;
BUTA, RJ ;
CORWIN, HG ;
DEVAUCOULEURS, G ;
DRESSLER, A ;
HUCHRA, JP ;
VANDENBERGH, S ;
RAYCHAUDHURY, S ;
SODRE, L ;
STORRIE-LOMBARDI, MC .
SCIENCE, 1995, 267 (5199) :859-862
[16]   Galaxy Zoo: the large-scale spin statistics of spiral galaxies in the Sloan Digital Sky Survey [J].
Land, Kate ;
Slosar, Anze ;
Lintott, Chris ;
Andreescu, Dan ;
Bamford, Steven ;
Murray, Phil ;
Nichol, Robert ;
Raddick, M. Jordan ;
Schawinski, Kevin ;
Szalay, Alex ;
Thomas, Daniel ;
Vandenberg, Jan .
MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2008, 388 (04) :1686-1692
[17]   Galaxy Zoo: morphologies derived from visual inspection of galaxies from the Sloan Digital Sky Survey [J].
Lintott, Chris J. ;
Schawinski, Kevin ;
Slosar, Anze ;
Land, Kate ;
Bamford, Steven ;
Thomas, Daniel ;
Raddick, M. Jordan ;
Nichol, Robert C. ;
Szalay, Alex ;
Andreescu, Dan ;
Murray, Phil ;
Vandenberg, Jan .
MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2008, 389 (03) :1179-1189
[18]   AUTOMATED MORPHOLOGICAL CLASSIFICATION OF APM GALAXIES BY SUPERVISED ARTIFICIAL NEURAL NETWORKS [J].
NAIM, A ;
LAHAV, O ;
SODRE, L ;
STORRIE-LOMBARDI, MC .
MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 1995, 275 (03) :567-590
[19]  
Ripley B. D., 1988, STAT INFERENCE SPATI, V2nd, DOI DOI 10.1017/CBO9780511624131
[20]  
Ripley B. D., 1981, Spatial Statistics