Random forest algorithm for classification of multiwavelength data

被引:45
作者
Gao, Dan [1 ,2 ]
Zhang, Yan-Xia [1 ]
Zhao, Yong-Heng [1 ]
机构
[1] Chinese Acad Sci, Natl Astron Observ, Beijing 100012, Peoples R China
[2] Chinese Acad Sci, Grad Univ, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
classification; astronomical databases: miscellaneous; catalogs; methods: data analysis; methods: statistical; SKY SURVEY;
D O I
10.1088/1674-4527/9/2/011
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
We introduced a decision tree method called Random Forests for multiwavelength data classification. The data were adopted from different databases, including the Sloan Digital Sky Survey (SDSS) Data Release five, USNO, FIRST and ROSAT. We then studied the discrimination of quasars from stars and the classification of quasars, stars and galaxies with the sample from optical and radio bands and with that from optical and X-ray bands. Moreover, feature selection and feature weighting based on Random Forests were investigated. The performances based on different input patterns were compared. The experimental results show that the random forest method is an effective method for astronomical object classification and can be applied to other classification problems faced in astronomy. In addition, Random Forests will show its superiorities due to its own merits, e.g. classification, feature selection, feature weighting as well as outlier detection.
引用
收藏
页码:220 / 226
页数:7
相关论文
共 19 条
[11]  
Voges W, 1999, ASTRON ASTROPHYS, V349, P389
[12]   A catalog of 1.4 GHz radio sources from the first survey [J].
White, RL ;
Becker, RH ;
Helfand, DJ ;
Gregg, MD .
ASTROPHYSICAL JOURNAL, 1997, 475 (02) :479-493
[13]   The Sloan Digital Sky Survey: Technical summary [J].
York, DG ;
Adelman, J ;
Anderson, JE ;
Anderson, SF ;
Annis, J ;
Bahcall, NA ;
Bakken, JA ;
Barkhouser, R ;
Bastian, S ;
Berman, E ;
Boroski, WN ;
Bracker, S ;
Briegel, C ;
Briggs, JW ;
Brinkmann, J ;
Brunner, R ;
Burles, S ;
Carey, L ;
Carr, MA ;
Castander, FJ ;
Chen, B ;
Colestock, PL ;
Connolly, AJ ;
Crocker, JH ;
Csabai, I ;
Czarapata, PC ;
Davis, JE ;
Doi, M ;
Dombeck, T ;
Eisenstein, D ;
Ellman, N ;
Elms, BR ;
Evans, ML ;
Fan, XH ;
Federwitz, GR ;
Fiscelli, L ;
Friedman, S ;
Frieman, JA ;
Fukugita, M ;
Gillespie, B ;
Gunn, JE ;
Gurbani, VK ;
de Haas, E ;
Haldeman, M ;
Harris, FH ;
Hayes, J ;
Heckman, TM ;
Hennessy, GS ;
Hindsley, RB ;
Holm, S .
ASTRONOMICAL JOURNAL, 2000, 120 (03) :1579-1587
[14]   Automated clustering algorithms for classification of astronomical objects [J].
Zhang, Y ;
Zhao, Y .
ASTRONOMY & ASTROPHYSICS, 2004, 422 (03) :1113-1121
[15]  
ZHANG Y, 2008, ADASS IN PRESS
[16]   A comparison of BBN, ADTree and MLP in separating quasars from large survey catalogues [J].
Zhang, Yan-Xia ;
Zhao, Yong-Heng .
CHINESE JOURNAL OF ASTRONOMY AND ASTROPHYSICS, 2007, 7 (02) :289-296
[17]   Classification in multidimensional parameter space: Methods and examples [J].
Zhang, YX ;
Zhao, YH .
PUBLICATIONS OF THE ASTRONOMICAL SOCIETY OF THE PACIFIC, 2003, 115 (810) :1006-1018
[18]   Learning vector quantization for classifying astronomical objects [J].
Zhang, YX ;
Zhao, YH .
CHINESE JOURNAL OF ASTRONOMY AND ASTROPHYSICS, 2003, 3 (02) :183-190
[19]   Comparison of decision tree methods for finding active objects [J].
Zhao, Yongheng ;
Zhang, Yanxia .
ADVANCES IN SPACE RESEARCH, 2008, 41 (12) :1955-1959