Efficient Algorithms for Range Mode Queries in the Big Data Era

被引:3
作者
Karras, Christos [1 ]
Theodorakopoulos, Leonidas [2 ]
Karras, Aristeidis [1 ]
Krimpas, George A. [1 ]
机构
[1] Univ Patras, Comp Engn & Informat Dept, Rion 26504, Greece
[2] Univ Patras, Dept Management Sci & Technol, Patras 26334, Greece
关键词
data structures; algorithms; RAM; range mode queries; big data; internal audit; DATA ANALYTICS;
D O I
10.3390/info15080450
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The mode is a fundamental descriptive statistic in data analysis, signifying the most frequent element within a dataset. The range mode query (RMQ) problem expands upon this concept by preprocessing an array A containing n natural numbers. This allows for the swift determination of the mode within any subarray A[a..b], thus optimizing the computation of the mode for a multitude of range queries. The efficacy of this process bears considerable importance in data analytics and retrieval across diverse platforms, including but not limited to online shopping experiences and financial auditing systems. This study is dedicated to exploring and benchmarking different algorithms and data structures designed to tackle the RMQ problem. The goal is to not only address the theoretical aspects of RMQ but also to provide practical solutions that can be applied in real-world scenarios, such as the optimization of an online shopping platform's understanding of customer preferences, enhancing the efficiency and effectiveness of data retrieval in large datasets.
引用
收藏
页数:37
相关论文
共 50 条
  • [1] Improvising Range Aggregate Queries in Big Data Environment
    Arbad, Ganesh R.
    Kulkarni, P. V.
    PROCEEDINGS OF THE 2018 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS), 2018, : 1896 - 1901
  • [2] The Business Mode Innovation in Big Data Era
    Zhang Yongsheng
    PROCEEDINGS OF INTERNATIONAL SYMPOSIUM - MANAGEMENT, INNOVATION & DEVELOPMENT (MID2014), 2014, : 274 - 277
  • [3] Big data, algorithms and politics: the social sciences in the era of social media
    Gonzalez, Felipe
    CINTA DE MOEBIO, 2019, 65 : 267 - 280
  • [4] Range queries on uncertain data
    Li, Jian
    Wang, Haitao
    THEORETICAL COMPUTER SCIENCE, 2016, 609 : 32 - 48
  • [5] Data Factory: An Efficient Data Analysis Solution in the Era of Big Data
    Wang, Yaojun
    Li, Yangyang
    Sui, Jingyan
    Gao, Yang
    2020 5TH IEEE INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS (IEEE ICBDA 2020), 2020, : 28 - 32
  • [6] Efficient Data Structures for Range Shortest Unique Substring Queries
    Abedin, Paniz
    Ganguly, Arnab
    Pissis, Solon P.
    Thankachan, Sharma V.
    ALGORITHMS, 2020, 13 (11) : 1 - 9
  • [7] Analysis on the Construction of Computer Data Processing Mode in the Era of Big Data
    Guo, Shi-Qi
    Zhai, Lei
    AGRO FOOD INDUSTRY HI-TECH, 2017, 28 (01): : 798 - 802
  • [8] Sufficiency Revisited: Rethinking Statistical Algorithms in the Big Data Era
    Lee, Jarod Y. L.
    Brown, James J.
    Ryan, Louise M.
    AMERICAN STATISTICIAN, 2017, 71 (03) : 202 - 208
  • [9] FastRAQ: A Fast Approach to Range-Aggregate Queries in Big Data Environments
    Yun, Xiaochun
    Wu, Guangjun
    Zhang, Guangyan
    Li, Keqin
    Wang, Shupeng
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2015, 3 (02) : 206 - 218
  • [10] Research on Innovation of Enterprise Management Mode in the Era of Big Data
    Jin, Wu
    2018 5TH INTERNATIONAL CONFERENCE ON BUSINESS, ECONOMICS AND MANAGEMENT (BUSEM 2018), 2018, : 330 - 332