Multi-sample means comparisons for imprecise interval data

被引:0
|
作者
Sun, Yan [1 ]
Rios, Zac [1 ,2 ]
Bean, Brennan [1 ]
机构
[1] Utah State Univ, Dept Math & Stat, 3900 Old Main Hill, Logan, UT 84322 USA
[2] Ent Credit Union, 11550 Ent Pkwy, Colorado Springs, CO 80921 USA
关键词
Interval-valued data; Hypothesis test; ANOVA; Random sets; Uncertainty; Asymptotics; LINEAR-REGRESSION MODELS; FUZZY; VARIABLES;
D O I
10.1016/j.ijar.2024.109322
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, interval data have become an increasingly popular tool to solving modern data problems. Intervals are now often used for dimensionality reduction, data aggregation, privacy censorship, and quantifying awareness of various uncertainties. Among many statistical methods that are being studied and developed for interval data, significance tests are of particular importance due to their fundamental value both in theory and practice. The difficulty in developing such tests mainly lies in the fact that the concept of normality does not extend naturally to intervals, making the exact tests hard to formulate. As a result, most existing works have relied on bootstrap methods to approximate null distributions. However, this is not always feasible given limited sample sizes or other intrinsic characteristics of the data. In this paper, we propose a novel asymptotic test for comparing multi-sample means with interval data as a generalization of the classic ANOVA. Based on the random sets theory, we construct the test statistic in the form of a ratio of between-group interval variance and within-group interval variance. The limiting null distribution is derived under usual assumptions and mild regularity conditions. Simulation studies with various data configurations validate the asymptotic result, and show promising small sample performances. Finally, a real interval data ANOVA analysis is presented that showcases the applicability of our method.
引用
收藏
页数:20
相关论文
共 26 条
  • [21] Land cover classification of remote sensing imagery based on interval-valued data fuzzy c-means algorithm
    Yu XianChuan
    He Hui
    Hu Dan
    Zhou Wei
    SCIENCE CHINA-EARTH SCIENCES, 2014, 57 (06) : 1306 - 1313
  • [22] Partitioning fuzzy c-means clustering algorithms for interval-valued data based on city-block distances
    de Carvalho, Francisco de A. T.
    Barbosa, Gibson B. N.
    Pimentel, Julio T.
    2013 BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2013, : 113 - 118
  • [23] Land cover classification of remote sensing imagery based on interval-valued data fuzzy c-means algorithm
    XianChuan Yu
    Hui He
    Dan Hu
    Wei Zhou
    Science China Earth Sciences, 2014, 57 : 1306 - 1313
  • [24] A belief interval euclidean distance entropy of the mass function and its application in multi-sensor data fusion
    Zhang, Fuxiao
    Chen, Zichong
    Cai, Rui
    APPLIED INTELLIGENCE, 2024, 54 (17-18) : 7545 - 7569
  • [25] An efficient multi-source information fusion approach for dynamic interval-valued data via fuzzy approximate conditional entropy
    Cai, Ke
    Xu, Weihua
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (09) : 3619 - 3645
  • [26] A Methodology for Modeling a Multi-Dimensional Joint Distribution of Parameters Based on Small-Sample Data, and Its Application in High Rockfill Dams
    Guo, Qinqin
    Huang, Huibao
    Lu, Xiang
    Chen, Jiankang
    Zhang, Xiaoshuang
    Zhao, Zhiyi
    APPLIED SCIENCES-BASEL, 2024, 14 (17):