Nonparametric inference for interval data using kernel methods

被引:0
作者
Park, Hoyoung [1 ]
Loh, Ji Meng [2 ]
Jang, Woncheol [3 ]
机构
[1] Sookmyung Womens Univ, Seoul, South Korea
[2] New Jersey Inst Technol, Newark, NJ USA
[3] Seoul Natl Univ, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
Cross validation; kernel density estimation; Nadaraya-Watson estimator; symbolic data; BANDWIDTH SELECTION; DENSITY-ESTIMATION;
D O I
10.1080/10485252.2022.2160980
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Symbolic data have become increasingly popular in the era of big data. In this paper, we consider density estimation and regression for interval-valued data, a special type of symbolic data, common in astronomy and official statistics. We propose kernel estimators with adaptive bandwidths to account for variability of each interval. Specifically, we derive cross-validation bandwidth selectors for density estimation and extend the Nadaraya-Watson estimator for regression with interval data. We assess the performance of the proposed methods in comparison with existing kernel methods by extensive simulation studies and real data analysis.
引用
收藏
页码:455 / 473
页数:19
相关论文
共 13 条
[1]  
[Anonymous], 1964, Sankhya: The Indian Journal of Statistics, Series A
[2]  
Billard Lynne., 2007, SELECTED CONTRIBUTIO, P3
[3]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[4]   A DEFINITION FOR GIANT PLANETS BASED ON THE MASS-DENSITY RELATIONSHIP [J].
Hatzes, Artie P. ;
Rauer, Heike .
ASTROPHYSICAL JOURNAL LETTERS, 2015, 810 (02)
[5]   KEPLER MISSION DESIGN, REALIZED PHOTOMETRIC PERFORMANCE, AND EARLY SCIENCE [J].
Koch, David G. ;
Borucki, William J. ;
Basri, Gibor ;
Batalha, Natalie M. ;
Brown, Timothy M. ;
Caldwell, Douglas ;
Christensen-Dalsgaard, Jorgen ;
Cochran, William D. ;
DeVore, Edna ;
Dunham, Edward W. ;
Gautier, Thomas N., III ;
Geary, John C. ;
Gilliland, Ronald L. ;
Gould, Alan ;
Jenkins, Jon ;
Kondo, Yoji ;
Latham, David W. ;
Lissauer, Jack J. ;
Marcy, Geoffrey ;
Monet, David ;
Sasselov, Dimitar ;
Boss, Alan ;
Brownlee, Donald ;
Caldwell, John ;
Dupree, Andrea K. ;
Howell, Steve B. ;
Kjeldsen, Hans ;
Meibom, Soren ;
Morrison, David ;
Owen, Tobias ;
Reitsema, Harold ;
Tarter, Jill ;
Bryson, Stephen T. ;
Dotson, Jessie L. ;
Gazis, Paul ;
Haas, Michael R. ;
Kolodziejczak, Jeffrey ;
Rowe, Jason F. ;
Van Cleve, Jeffrey E. ;
Allen, Christopher ;
Chandrasekaran, Hema ;
Clarke, Bruce D. ;
Li, Jie ;
Quintana, Elisa V. ;
Tenenbaum, Peter ;
Twicken, Joseph D. ;
Wu, Hayley .
ASTROPHYSICAL JOURNAL LETTERS, 2010, 713 (02) :L79-L86
[6]   Bandwidth selection: Classical or plug-in? [J].
Loader, CR .
ANNALS OF STATISTICS, 1999, 27 (02) :415-438
[7]  
Nadaraya E. A., 1964, Theory of Probability Its Applications, V9, P141, DOI DOI 10.1137/1109020
[8]   ESTIMATION OF A PROBABILITY DENSITY-FUNCTION AND MODE [J].
PARZEN, E .
ANNALS OF MATHEMATICAL STATISTICS, 1962, 33 (03) :1065-&
[9]   Bandwidth selection in kernel density estimation for interval-grouped data [J].
Reyes, Miguel ;
Francisco-Fernandez, Mario ;
Cao, Ricardo .
TEST, 2017, 26 (03) :527-545
[10]   Nonparametric kernel density estimation for general groupeddata [J].
Reyes, Miguel ;
Francisco-Fernandez, Mario ;
Cao, Ricardo .
JOURNAL OF NONPARAMETRIC STATISTICS, 2016, 28 (02) :235-249