Effective signal reconstruction from multiple ranked lists via convex optimization

被引:0
作者
Michael G. Schimek
Luca Vitale
Bastian Pfeifer
Michele La Rocca
机构
[1] Medical University of Graz,Medical Informatics, Statistics and Documentation
[2] University of Salerno,Department of Economics and Statistics
来源
Data Mining and Knowledge Discovery | 2024年 / 38卷
关键词
Ranking data; Rank centrality; Signal estimation; Convex optimization; Poisson bootstrap;
D O I
暂无
中图分类号
学科分类号
摘要
The ranking of objects is widely used to rate their relative quality or relevance across multiple assessments. Beyond classical rank aggregation, it is of interest to estimate the usually unobservable latent signals that inform a consensus ranking. Under the only assumption of independent assessments, which can be incomplete, we introduce indirect inference via convex optimization in combination with computationally efficient Poisson Bootstrap. Two different objective functions are suggested, one linear and the other quadratic. The mathematical formulation of the signal estimation problem is based on pairwise comparisons of all objects with respect to their rank positions. Sets of constraints represent the order relations. The transitivity property of rank scales allows us to reduce substantially the number of constraints associated with the full set of object comparisons. The key idea is to globally reduce the errors induced by the rankers until optimal latent signals can be obtained. Its main advantage is low computational costs, even when handling n<<p\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$n < < p$$\end{document} data problems. Exploratory tools can be developed based on the bootstrap signal estimates and standard errors. Simulation evidence, a comparison with the state-of-the-art rank centrality method, and two applications, one in higher education evaluation and the other in molecular cancer research, are presented.
引用
收藏
页码:1125 / 1169
页数:44
相关论文
共 71 条
[1]  
Cucuringu M(2016)Sync-Rank: Robust ranking, constrained ranking and rank aggregation via eigenvector and SDP synchronizalion IEEE Trans Netw Sci Eng 3 58-79
[2]  
Fagin R(2003)Comparing top-k lists SIAM J Discr Math 17 134-160
[3]  
Kumar R(1988)Multistage ranking models J Am Stat Assoc 83 892-901
[4]  
Sivakumar D(2012)Moderate deviation-based inference for random degeneration in paired rank lists J Am Stat Assoc 107 661-672
[5]  
Fligner MA(2022)The protein level of the tumour-promoting factor SET is regulated by cell density J Biochem 3 295-303
[6]  
Verducci JS(2020)An extended Mallows model for ranked data aggregation J Am Stat Assoc 115 730-746
[7]  
Hall P(2022)Bayesian analysis of rank data with covariates and heterogeneous rankers Stat Sci 37 1-23
[8]  
Schimek MG(2009)Integration of ranked lists via Cross Entropy Monte Carlo with applications to mRNA and microRNA studies Biometrics 65 9-18
[9]  
Kohyanagi N(1973)Conditional logit analysis of qualitative choice behavior Front Econ 1 105-142
[10]  
Kitamura N(1957)Non null ranking models I Biometrika 44 114-130