Runtime Data Layout Scheduling for Machine Learning Dataset

被引:5
|
作者
You, Yang [1 ]
Demmel, James [1 ]
机构
[1] Univ Calif Berkeley, Div Comp Sci, Berkeley, CA 94720 USA
来源
2017 46TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP) | 2017年
关键词
parallel auto-tuning; machine learning;
D O I
10.1109/ICPP.2017.54
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Machine Learning (ML) approaches are widely-used classification/regression methods for data mining applications. However, the time-consuming training process greatly limits the efficiency of ML approaches. We use the example of SVM (traditional ML algorithm) and DNN (state-of-the-art ML algorithm) to illustrate the idea in this paper. For SVM, a major performance bottleneck of current tools is that they use a unified data storage format because the data formats can have a significant influence on the complexity of storage and computation, memory bandwidth, and the efficiency of parallel processing. To address the problem above, we study the factors influencing the algorithm's performance and conduct auto-tuning to speed up SVM training. DNN training is even slower than SVM. For example, using a 8-core CPUs to train AlexNet model by CIFAR-10 dataset costs 8.2 hours. CIFAR-10 is only 170 MB, which is not efficient for distributed processing. Moreover, due to the algorithm limitation, only a small batch of data can be processed at each iteration. We focus on finding the right algorithmic parameters and using auto-tuning techniques to make the algorithm run faster. For SVM training, our implementation achieves 1.7-16.3x speedup (6.8x on average) against the non-adaptive case (using the worst data format) for various datasets. For DNN training on CIFAR-10 dataset, we reduce the time from 8.2 hours to only roughly 1 minute. We use the benchmark of dollars per speedup to help the users to select the right deep learning hardware.
引用
收藏
页码:452 / 461
页数:10
相关论文
共 50 条
  • [21] A runtime execution environment for machine-learning laboratory work
    Tollefsen, Mikael
    Osen, Ottar L.
    PROCEEDINGS OF THE 2021 IEEE GLOBAL ENGINEERING EDUCATION CONFERENCE (EDUCON), 2021, : 664 - 669
  • [22] A Machine Learning Approach Towards Runtime Optimisation of Matrix Multiplication
    Xia, Yufan
    De La Pierre, Marco
    Barnard, Amanda S.
    Barca, Giuseppe Maria Junior
    2023 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, IPDPS, 2023, : 524 - 534
  • [23] Dataset of cannabis seeds for machine learning applications
    Chumchu, Prawit
    Patil, Kailas
    DATA IN BRIEF, 2023, 47
  • [24] SCHEDULING THE ALLOCATION OF DATA FRAGMENTS IN A DISTRIBUTED DATABASE ENVIRONMENT - A MACHINE LEARNING APPROACH
    CHATURVEDI, AR
    CHOUBEY, AK
    ROAN, JS
    IEEE TRANSACTIONS ON ENGINEERING MANAGEMENT, 1994, 41 (02) : 194 - 207
  • [25] Characterizing Machine Learning-Based Runtime Prefetcher Selection
    Alcorta, Erika S.
    Madhav, Mahesh
    Afoakwa, Richard
    Tetrick, Scott
    Yadwadkar, Neeraja J.
    Gerstlauer, Andreas
    IEEE COMPUTER ARCHITECTURE LETTERS, 2024, 23 (02) : 146 - 149
  • [26] Machine Learning Predictions for Underestimation of Job Runtime on HPC System
    Guo, Jian
    Nomura, Akihiro
    Barton, Ryan
    Zhang, Haoyu
    Matsuoka, Satoshi
    SUPERCOMPUTING FRONTIERS, SCFA 2018, 2018, 10776 : 179 - 198
  • [27] Surgical scheduling via optimization and machine learning with long-tailed data
    Shi, Yuan
    Mahdian, Saied
    Blanchet, Jose
    Glynn, Peter
    Shin, Andrew Y.
    Scheinker, David
    HEALTH CARE MANAGEMENT SCIENCE, 2023, 26 (04) : 692 - 718
  • [28] DescribeML: A dataset description tool for machine learning
    Giner-Miguelez, Joan
    Gomez, Abel
    Cabot, Jordi
    SCIENCE OF COMPUTER PROGRAMMING, 2024, 231
  • [29] Runtime and memory consumption analyses for machine learning R programs
    Kotthaus, Helena
    Korb, Ingo
    Lang, Michel
    Bischl, Bernd
    Rahnenfuehrer, Joerg
    Marwedel, Peter
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2015, 85 (01) : 14 - 29
  • [30] Specifics of Data Collection and Data Processing during Formation of RailVista Dataset for Machine Learning- and Deep Learning-Based Applications
    Abisheva, Gulsipat
    Goranin, Nikolaj
    Razakhova, Bibigul
    Aidynov, Tolegen
    Satybaldina, Dina
    SENSORS, 2024, 24 (16)