On discriminating between lognormal and Pareto tail: an unsupervised mixture-based approach

被引:6
作者
Bee, Marco [1 ]
机构
[1] Univ Trento, Dept Econ & Management, Via Inama 5, I-38122 Trento, Italy
关键词
Mixture distributions; EM algorithm; Profile likelihood; Unsupervised tail estimation; GIBRATS LAW; MAXIMUM-LIKELIHOOD; POWER LAWS; CITIES; SIZE; ZIPF; DISTRIBUTIONS;
D O I
10.1007/s11634-022-00497-4
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Many stochastic models in economics and finance are described by distributions with a lognormal body. Testing for a possible Pareto tail and estimating the parameters of the Pareto distribution in these models is an important topic. Although the problem has been extensively studied in the literature, most applications are characterized by some weaknesses. We propose a method that exploits all the available information by taking into account the data generating process of the whole population. After estimating a lognormal-Pareto mixture with a known threshold via the EM algorithm, we exploit this result to develop an unsupervised tail estimation approach based on the maximization of the profile likelihood function. Monte Carlo experiments and two empirical applications to the size of US metropolitan areas and of firms in an Italian district confirm that the proposed method works well and outperforms two commonly used techniques. Simulation results are available in an online supplementary appendix.
引用
收藏
页码:251 / 269
页数:19
相关论文
共 37 条
[1]   Modeling loss data using composite models [J].
Abu Bakar, S. A. ;
Hamzah, N. A. ;
Maghsoudi, M. ;
Nadarajah, S. .
INSURANCE MATHEMATICS & ECONOMICS, 2015, 61 :146-154
[2]   Zipf distribution of US firm sizes [J].
Axtell, RL .
SCIENCE, 2001, 293 (5536) :1818-1820
[3]   Where Gibrat meets Zipf: Scale and scope of French firms [J].
Bee, Marco ;
Riccaboni, Massimo ;
Schiavo, Stefano .
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2017, 481 :265-275
[4]   A Bayesian look at American academic wages: From wage dispersion to wage compression [J].
Benzidia, Majda ;
Lubrano, Michel .
JOURNAL OF ECONOMIC INEQUALITY, 2020, 18 (02) :213-238
[5]   The city size distribution debate: Resolution for US urban regions and megalopolitan areas [J].
Berry, Brian J. L. ;
Okulicz-Kozaryn, Adam .
CITIES, 2012, 29 :S17-S23
[6]   Choosing starting values for the EM algorithm for getting the highest likelihood in multivariate Gaussian mixture models [J].
Biernacki, C ;
Celeux, G ;
Govaert, G .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2003, 41 (3-4) :561-575
[7]   Power-Law Distributions in Empirical Data [J].
Clauset, Aaron ;
Shalizi, Cosma Rohilla ;
Newman, M. E. J. .
SIAM REVIEW, 2009, 51 (04) :661-703
[8]  
D'Acci L., 2019, MATH URBAN MORPHOLOG, DOI DOI 10.1007/978-3-030-12381-9
[9]   The best test of exponentiality against singly truncated normal alternatives [J].
del Castillo, J ;
Puig, P .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1999, 94 (446) :529-532
[10]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38