Discriminant analysis of distributional data via fractional programming

被引:6
作者
Dias, Sonia [1 ,2 ]
Brito, Paula [2 ,3 ]
Amaral, Paula [4 ,5 ]
机构
[1] Inst Politecn Viana Castelo, Viana Do Castelo, Portugal
[2] INESC TEC, LIAAD, Porto, Portugal
[3] Univ Porto, Fac Econ, Porto, Portugal
[4] Univ Nova Lisboa, CMA, Lisbon, Portugal
[5] Univ Nova Lisboa, Fac Sci & Technol, Lisbon, Portugal
关键词
Classification; Data science; Histogram data; Multivariate statistics; Symbolic data analysis; INTERVAL DATA; MODEL;
D O I
10.1016/j.ejor.2021.01.025
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
We address classification of distributional data, where units are described by histogram or interval-valued variables. The proposed approach uses a linear discriminant function where distributions or intervals are represented by quantile functions, under specific assumptions. This discriminant function allows defining a score for each unit, in the form of a quantile function, which is used to classify the units in two a priori groups, using the Mallows distance. There is a diversity of application areas for the proposed linear discriminant method. In this work we classify the airline companies operating in NY airports based on air time and arrival/departure delays, using a full year flights. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:206 / 218
页数:13
相关论文
共 32 条
[1]   Copositivity and constrained fractional quadratic problems [J].
Amaral, Paula ;
Bomze, Immanuel M. ;
Judice, Joaquim .
MATHEMATICAL PROGRAMMING, 2014, 146 (1-2) :325-350
[2]  
[Anonymous], 2008, SYMBOLIC DATA ANAL S
[3]   Smoothing Methods for Histogram-Valued Time Series: An Application to Value-at-Risk [J].
Arroyo J. ;
González-Rivera G. ;
Maté C. ;
San Roque A.M. .
Statistical Analysis and Data Mining, 2011, 4 (02) :216-228
[4]   Forecasting histogram time series with k-nearest neighbours methods [J].
Arroyo, Javier ;
Mate, Carlos .
INTERNATIONAL JOURNAL OF FORECASTING, 2009, 25 (01) :192-207
[5]  
Balzanella A., 2018, ARXIV PREPRINT ARXIV
[6]  
Bertrand P, 2000, ST CLASS DAT ANAL, P106
[7]   From the statistics of data to the statistics of knowledge: Symbolic data analysis [J].
Billard, L ;
Diday, E .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2003, 98 (462) :470-487
[8]  
Billard L., 2004, SYMBOLIC DATA ANAL C
[9]  
Bock H.H., 2000, Analysis of Symbolic Data, Exploratory Methods for Extracting Statistical Information from Complex Data
[10]   Symbolic Data Analysis: another look at the interaction of Data Mining and Statistics [J].
Brito, Paula .
WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2014, 4 (04) :281-295