A distributed approach for accelerating sparse matrix arithmetic operations for high-dimensional feature selection

被引:2
作者
Tommasel, Antonela [1 ]
Godoy, Daniela [1 ]
Zunino, Alejandro [1 ]
Mateos, Cristian [1 ]
机构
[1] UNICEN CONICET, ISISTAN, Campus Univ, Tandil, Buenos Aires, Argentina
关键词
Sparse matrix; Matrix arithmetic operation; Feature selection; Distributed computing; PARALLEL; ALGORITHM; LIBRARY; FACTORIZATION;
D O I
10.1007/s10115-016-0981-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Matrix computations are both fundamental and ubiquitous in computational science, and as a result, they are frequently used in numerous disciplines of scientific computing and engineering. Due to the high computational complexity of matrix operations, which makes them critical to the performance of a large number of applications, their efficient execution in distributed environments becomes a crucial issue. This work proposes a novel approach for distributing sparse matrix arithmetic operations on computer clusters aiming at speeding-up the processing of high-dimensional matrices. The approach focuses on how to split such operations into independent parallel tasks by considering the intrinsic characteristics that distinguish each type of operation and the particular matrices involved. The approach was applied to the most commonly used arithmetic operations between matrices. The performance of the presented approach was evaluated considering a high-dimensional text feature selection approach and two real-world datasets. Experimental evaluation showed that the proposed approach helped to significantly reduce the computing times of big-scale matrix operations, when compared to serial and multi-thread implementations as well as several linear algebra software libraries.
引用
收藏
页码:459 / 497
页数:39
相关论文
共 50 条
  • [21] A filter feature selection for high-dimensional data
    Janane, Fatima Zahra
    Ouaderhman, Tayeb
    Chamlal, Hasna
    JOURNAL OF ALGORITHMS & COMPUTATIONAL TECHNOLOGY, 2023, 17
  • [22] Embedded feature selection approach based on TSK fuzzy system with sparse rule base for high-dimensional classification problems
    Gong, Xiaoling
    Wang, Jian
    Ren, Qilin
    Zhang, Kai
    El-Alfy, El-Sayed M.
    Mandziuk, Jacek
    KNOWLEDGE-BASED SYSTEMS, 2024, 295
  • [23] High-dimensional feature selection for genomic datasets
    Afshar, Majid
    Usefi, Hamid
    KNOWLEDGE-BASED SYSTEMS, 2020, 206
  • [24] FEATURE SELECTION FOR HIGH-DIMENSIONAL DATA ANALYSIS
    Verleysen, Michel
    ECTA 2011/FCTA 2011: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON EVOLUTIONARY COMPUTATION THEORY AND APPLICATIONS AND INTERNATIONAL CONFERENCE ON FUZZY COMPUTATION THEORY AND APPLICATIONS, 2011,
  • [25] Filter Feature Selection Performance Comparison in High-dimensional Data
    Huertas, Carlos
    Juarez-Ramirez, Reyes
    2014 17TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2014,
  • [26] On online high-dimensional spherical data clustering and feature selection
    Amayri, Ola
    Bouguila, Nizar
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2013, 26 (04) : 1386 - 1398
  • [27] Feature selection in high-dimensional microarray cancer datasets using an improved equilibrium optimization approach
    Balakrishnan, Kulanthaivel
    Dhanalakshmi, Ramasamy
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (28)
  • [28] A Space Transformation-Based Multiform Approach for Multiobjective Feature Selection in High-Dimensional Classification
    Yu, Kunjie
    Sun, Shaoru
    Liang, Jing
    Chen, Ke
    Qu, Boyang
    Yue, Caitong
    Suganthan, Ponnuthurai Nagaratnam
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (12): : 7305 - 7317
  • [29] Feature selection for high-dimensional regression via sparse LSSVR based on Lp-norm
    Li, Chun-Na
    Shao, Yuan-Hai
    Zhao, Da
    Guo, Yan-Ru
    Hua, Xiang-Yu
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2021, 36 (02) : 1108 - 1130
  • [30] Hybrid binary arithmetic optimization algorithm with simulated annealing for feature selection in high-dimensional biomedical data
    Pashaei, Elham
    Pashaei, Elnaz
    JOURNAL OF SUPERCOMPUTING, 2022, 78 (13) : 15598 - 15637