Multi-sample means comparisons for imprecise interval data

被引:0
|
作者
Sun, Yan [1 ]
Rios, Zac [1 ,2 ]
Bean, Brennan [1 ]
机构
[1] Utah State Univ, Dept Math & Stat, 3900 Old Main Hill, Logan, UT 84322 USA
[2] Ent Credit Union, 11550 Ent Pkwy, Colorado Springs, CO 80921 USA
关键词
Interval-valued data; Hypothesis test; ANOVA; Random sets; Uncertainty; Asymptotics; LINEAR-REGRESSION MODELS; FUZZY; VARIABLES;
D O I
10.1016/j.ijar.2024.109322
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, interval data have become an increasingly popular tool to solving modern data problems. Intervals are now often used for dimensionality reduction, data aggregation, privacy censorship, and quantifying awareness of various uncertainties. Among many statistical methods that are being studied and developed for interval data, significance tests are of particular importance due to their fundamental value both in theory and practice. The difficulty in developing such tests mainly lies in the fact that the concept of normality does not extend naturally to intervals, making the exact tests hard to formulate. As a result, most existing works have relied on bootstrap methods to approximate null distributions. However, this is not always feasible given limited sample sizes or other intrinsic characteristics of the data. In this paper, we propose a novel asymptotic test for comparing multi-sample means with interval data as a generalization of the classic ANOVA. Based on the random sets theory, we construct the test statistic in the form of a ratio of between-group interval variance and within-group interval variance. The limiting null distribution is derived under usual assumptions and mild regularity conditions. Simulation studies with various data configurations validate the asymptotic result, and show promising small sample performances. Finally, a real interval data ANOVA analysis is presented that showcases the applicability of our method.
引用
收藏
页数:20
相关论文
共 26 条
  • [1] The Multi-Sample Independence Test
    Marques, Filipe J.
    Coelho, Carlos A.
    NUMERICAL ANALYSIS AND APPLIED MATHEMATICS (ICNAAM 2012), VOLS A AND B, 2012, 1479 : 1129 - 1132
  • [2] Classification of Imprecise Data Using Interval Fisher Discriminator
    Mansouri, Jafar
    Yazdi, Hadi Sadoghi
    Khademi, Morteza
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2011, 26 (08) : 718 - 730
  • [3] Efficient training of interval Neural Networks for imprecise training data
    Sadeghi, Jonathan
    de Angelis, Marco
    Patelli, Edoardo
    NEURAL NETWORKS, 2019, 118 : 338 - 351
  • [4] Two-Sample Dispersion Tests for Interval-Valued Data
    Grzegorzewski, Przemyslaw
    INFORMATION PROCESSING AND MANAGEMENT OF UNCERTAINTY IN KNOWLEDGE-BASED SYSTEMS: APPLICATIONS, IPMU 2018, PT III, 2018, 855 : 40 - 51
  • [5] Integrating imprecise data in generative models using interval-valued Variational Autoencoders
    Sanchez, Luciano
    Costa, Nahuel
    Couso, Ines
    Strauss, Olivier
    INFORMATION FUSION, 2025, 114
  • [6] A robust fuzzy k-means clustering model for interval valued data
    Pierpaolo D’Urso
    Paolo Giordani
    Computational Statistics, 2006, 21 : 251 - 269
  • [7] Asymptotic algorithm for computing the sample variance of interval data
    Kolacz, A.
    Grzegorzewski, P.
    IRANIAN JOURNAL OF FUZZY SYSTEMS, 2019, 16 (04): : 83 - 96
  • [8] INCM: neutrosophic c-means clustering algorithm for interval-valued data
    Qiu, Haoye
    Liu, Zhe
    Letchmunan, Sukumar
    GRANULAR COMPUTING, 2024, 9 (02)
  • [9] Two-sample tests for interval-valued data
    Hyejeong Choi
    Johan Lim
    Donghyeon Yu
    Minjung Kwak
    Journal of the Korean Statistical Society, 2021, 50 : 233 - 271
  • [10] Two-sample tests for interval-valued data
    Choi, Hyejeong
    Lim, Johan
    Yu, Donghyeon
    Kwak, Minjung
    JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2021, 50 (01) : 233 - 271