ParIS plus : Data Series Indexing on Multi-Core Architectures

被引:8
|
作者
Peng, Botao [1 ]
Fatourou, Panagiota [2 ,3 ]
Palpanas, Themis [1 ]
机构
[1] Univ Paris, LIPADE, F-75006 Paris, France
[2] FORTH, ICS, Iraklion, Greece
[3] Univ Crete, Dept Comp Sci, Iraklion 70013, Greece
关键词
Indexing; Task analysis; Parallel processing; Hardware; Aggregates; Multicore processing; Data series; time series; indexing; similarity search; query answering; multi-core architectures; parallelization; DATABASES;
D O I
10.1109/TKDE.2020.2975180
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data series similarity search is a core operation for several data series analysis applications across many different domains. Nevertheless, even state-of-the-art techniques cannot provide the time performance required for large data series collections. We propose ParIS and ParIS+, the first disk-based data series indices carefully designed to inherently take advantage of multi-core architectures, in order to accelerate similarity search processing times. Our experiments demonstrate that ParIS+ completely removes the CPU latency during index construction for disk-resident data, and for exact query answering is up to 1 order of magnitude faster than the current state of the art index scan method, and up to 3 orders of magnitude faster than the optimized serial scan method. ParIS+ (which is an evolution of the ADS+ index) owes its efficiency to the effective use of multi-core and multi-socket architectures, in order to distribute and execute in parallel both index construction and query answering, and to the exploitation of the Single Instruction Multiple Data (SIMD) capabilities of modern CPUs, in order to further parallelize the execution of instructions inside each core.
引用
收藏
页码:2151 / 2164
页数:14
相关论文
共 50 条
  • [1] Fast data series indexing for in-memory data
    Peng, Botao
    Fatourou, Panagiota
    Palpanas, Themis
    VLDB JOURNAL, 2021, 30 (06): : 1041 - 1067
  • [2] Performance optimization of the MGB hydrological model for multi-core and GPU architectures
    Freitas, Henrique R. A.
    Mendes, Celso L.
    Ilic, Aleksandar
    ENVIRONMENTAL MODELLING & SOFTWARE, 2022, 148
  • [3] Performance issues in emerging homogeneous multi-core architectures
    Kayi, Abdullah
    El-Ghazawi, Tarek
    Newby, Gregory B.
    SIMULATION MODELLING PRACTICE AND THEORY, 2009, 17 (09) : 1485 - 1499
  • [4] Multi-Core (CPU and GPU) for Permutation-Based Indexing
    Mohamed, Hisham
    Osipyan, Hasmik
    Marchand-Maillet, Stephane
    SIMILARITY SEARCH AND APPLICATIONS, 2014, 8821 : 277 - 288
  • [5] Synthesis of Pareto Efficient Technical Architectures for Multi-Core Systems
    Zverlov, Sergey
    Voss, Sebastian
    2014 38TH ANNUAL IEEE INTERNATIONAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE WORKSHOPS (COMPSACW 2014), 2014, : 366 - 371
  • [6] Nearest Neighbor Affinity Scheduling In Heterogeneous Multi-Core Architectures
    Sibai, Fadi N.
    JOURNAL OF COMPUTER SCIENCE & TECHNOLOGY, 2008, 8 (03): : 144 - 150
  • [7] Parallel Subspace Clustering Using Multi-core and Many-core Architectures
    Datta, Amitava
    Kaur, Amardeep
    Lauer, Tobias
    Chabbouh, Sami
    NEW TRENDS IN DATABASES AND INFORMATION SYSTEMS, ADBIS 2017, 2017, 767 : 213 - 223
  • [8] ParaMiner: a generic pattern mining algorithm for multi-core architectures
    Benjamin Negrevergne
    Alexandre Termier
    Marie-Christine Rousset
    Jean-François Méhaut
    Data Mining and Knowledge Discovery, 2014, 28 : 593 - 633
  • [9] PARAMINER: a generic pattern mining algorithm for multi-core architectures
    Negrevergne, Benjamin
    Termier, Alexandre
    Rousset, Marie-Christine
    Mehaut, Jean-Francois
    DATA MINING AND KNOWLEDGE DISCOVERY, 2014, 28 (03) : 593 - 633
  • [10] Data Series Indexing Gone Parallel
    Peng, Botao
    2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, : 2059 - 2063