Multi-sample means comparisons for imprecise interval data

被引：0

作者：

Sun, Yan ^{[1
]}

Rios, Zac ^{[1
,2
]}

Bean, Brennan ^{[1
]}

机构：

[1] Utah State Univ, Dept Math & Stat, 3900 Old Main Hill, Logan, UT 84322 USA

[2] Ent Credit Union, 11550 Ent Pkwy, Colorado Springs, CO 80921 USA

来源：

INTERNATIONAL JOURNAL OF APPROXIMATE REASONING | 2025年 / 176卷

关键词：

Interval-valued data; Hypothesis test; ANOVA; Random sets; Uncertainty; Asymptotics; LINEAR-REGRESSION MODELS; FUZZY; VARIABLES;

D O I：

10.1016/j.ijar.2024.109322

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In recent years, interval data have become an increasingly popular tool to solving modern data problems. Intervals are now often used for dimensionality reduction, data aggregation, privacy censorship, and quantifying awareness of various uncertainties. Among many statistical methods that are being studied and developed for interval data, significance tests are of particular importance due to their fundamental value both in theory and practice. The difficulty in developing such tests mainly lies in the fact that the concept of normality does not extend naturally to intervals, making the exact tests hard to formulate. As a result, most existing works have relied on bootstrap methods to approximate null distributions. However, this is not always feasible given limited sample sizes or other intrinsic characteristics of the data. In this paper, we propose a novel asymptotic test for comparing multi-sample means with interval data as a generalization of the classic ANOVA. Based on the random sets theory, we construct the test statistic in the form of a ratio of between-group interval variance and within-group interval variance. The limiting null distribution is derived under usual assumptions and mild regularity conditions. Simulation studies with various data configurations validate the asymptotic result, and show promising small sample performances. Finally, a real interval data ANOVA analysis is presented that showcases the applicability of our method.

引用

页数：20

共 26 条

[1] The Multi-Sample Independence Test
Marques, Filipe J.
Coelho, Carlos A.
NUMERICAL ANALYSIS AND APPLIED MATHEMATICS (ICNAAM 2012), VOLS A AND B, 2012, 1479 : 1129 - 1132
[2] Classification of Imprecise Data Using Interval Fisher Discriminator
Mansouri, Jafar
Yazdi, Hadi Sadoghi
Khademi, Morteza
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2011, 26 (08) : 718 - 730
[3] Efficient training of interval Neural Networks for imprecise training data
Sadeghi, Jonathan
de Angelis, Marco
Patelli, Edoardo
NEURAL NETWORKS, 2019, 118 : 338 - 351
[4] Two-Sample Dispersion Tests for Interval-Valued Data
Grzegorzewski, Przemyslaw
INFORMATION PROCESSING AND MANAGEMENT OF UNCERTAINTY IN KNOWLEDGE-BASED SYSTEMS: APPLICATIONS, IPMU 2018, PT III, 2018, 855 : 40 - 51
[5] Integrating imprecise data in generative models using interval-valued Variational Autoencoders
Sanchez, Luciano
Costa, Nahuel
Couso, Ines
Strauss, Olivier
INFORMATION FUSION, 2025, 114
[6] A robust fuzzy k-means clustering model for interval valued data
Pierpaolo D’Urso
Paolo Giordani
Computational Statistics, 2006, 21 : 251 - 269
[7] Asymptotic algorithm for computing the sample variance of interval data
Kolacz, A.
Grzegorzewski, P.
IRANIAN JOURNAL OF FUZZY SYSTEMS, 2019, 16 (04): : 83 - 96
[8] INCM: neutrosophic c-means clustering algorithm for interval-valued data
Qiu, Haoye
Liu, Zhe
Letchmunan, Sukumar
GRANULAR COMPUTING, 2024, 9 (02)
[9] Two-sample tests for interval-valued data
Hyejeong Choi
Johan Lim
Donghyeon Yu
Minjung Kwak
Journal of the Korean Statistical Society, 2021, 50 : 233 - 271
[10] Two-sample tests for interval-valued data
Choi, Hyejeong
Lim, Johan
Yu, Donghyeon
Kwak, Minjung
JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2021, 50 (01) : 233 - 271

← 1 2 3 →