A distributed approach for accelerating sparse matrix arithmetic operations for high-dimensional feature selection

被引:2
作者
Tommasel, Antonela [1 ]
Godoy, Daniela [1 ]
Zunino, Alejandro [1 ]
Mateos, Cristian [1 ]
机构
[1] UNICEN CONICET, ISISTAN, Campus Univ, Tandil, Buenos Aires, Argentina
关键词
Sparse matrix; Matrix arithmetic operation; Feature selection; Distributed computing; PARALLEL; ALGORITHM; LIBRARY; FACTORIZATION;
D O I
10.1007/s10115-016-0981-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Matrix computations are both fundamental and ubiquitous in computational science, and as a result, they are frequently used in numerous disciplines of scientific computing and engineering. Due to the high computational complexity of matrix operations, which makes them critical to the performance of a large number of applications, their efficient execution in distributed environments becomes a crucial issue. This work proposes a novel approach for distributing sparse matrix arithmetic operations on computer clusters aiming at speeding-up the processing of high-dimensional matrices. The approach focuses on how to split such operations into independent parallel tasks by considering the intrinsic characteristics that distinguish each type of operation and the particular matrices involved. The approach was applied to the most commonly used arithmetic operations between matrices. The performance of the presented approach was evaluated considering a high-dimensional text feature selection approach and two real-world datasets. Experimental evaluation showed that the proposed approach helped to significantly reduce the computing times of big-scale matrix operations, when compared to serial and multi-thread implementations as well as several linear algebra software libraries.
引用
收藏
页码:459 / 497
页数:39
相关论文
共 50 条
  • [1] A distributed approach for accelerating sparse matrix arithmetic operations for high-dimensional feature selection
    Antonela Tommasel
    Daniela Godoy
    Alejandro Zunino
    Cristian Mateos
    Knowledge and Information Systems, 2017, 51 : 459 - 497
  • [2] SMArtOp: A Java']Java library for distributing high-dimensional sparse-matrix arithmetic operations
    Tommasel, Antonela
    Godoy, Daniela
    Zunino, Alejandro
    SCIENCE OF COMPUTER PROGRAMMING, 2017, 150 : 26 - 30
  • [3] Minimax Sparse Logistic Regression for Very High-Dimensional Feature Selection
    Tan, Mingkui
    Tsang, Ivor W.
    Wang, Li
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2013, 24 (10) : 1609 - 1622
  • [4] Multistage feature selection approach for high-dimensional cancer data
    Alkuhlani, Alhasan
    Nassef, Mohammad
    Farag, Ibrahim
    SOFT COMPUTING, 2017, 21 (22) : 6895 - 6906
  • [5] High-dimensional feature selection via feature grouping: A Variable Neighborhood Search approach
    Garcia-Torres, Miguel
    Gomez-Vela, Francisco
    Melian-Batista, Belen
    Marcos Moreno-Vega, J.
    INFORMATION SCIENCES, 2016, 326 : 102 - 118
  • [6] Synergistic feature selection and distributed classification framework for high-dimensional medical data analysis
    Dhinakaran, D.
    Srinivasan, L.
    Raja, S. Edwin
    Valarmathi, K.
    Nayagam, M. Gomathy
    METHODSX, 2025, 14
  • [7] Multistage feature selection approach for high-dimensional cancer data
    Alhasan Alkuhlani
    Mohammad Nassef
    Ibrahim Farag
    Soft Computing, 2017, 21 : 6895 - 6906
  • [8] A sequential approach to feature selection in high-dimensional additive models
    Gong, Yuan
    Chen, Zehua
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2021, 215 : 289 - 298
  • [9] A hybrid feature selection approach based on ensemble method for high-dimensional data
    Rouhi, Amirreza
    Nezamabadi-pour, Hossein
    2017 2ND CONFERENCE ON SWARM INTELLIGENCE AND EVOLUTIONARY COMPUTATION (CSIEC), 2017, : 16 - 20
  • [10] Feature selection for high-dimensional data
    Bolón-Canedo V.
    Sánchez-Maroño N.
    Alonso-Betanzos A.
    Progress in Artificial Intelligence, 2016, 5 (2) : 65 - 75