Machine Learning-Based Real-time Task Scheduling for Apache Storm

被引:0
|
作者
Wu, Cheng-Ying [2 ]
Zhao, Qi [1 ]
Cheng, Cheng-Yu [2 ]
Yang, Yuchen [1 ]
Qureshi, Muhammad A. [3 ]
Liu, Hang [2 ]
Chen, Genshe [1 ]
机构
[1] Intelligent Fus Technol Inc, 20410 Century Blvd,Suite 230, Germantown, MD 20874 USA
[2] Catholic Univ Amer, 620 Michigan Ave NE, Washington, DC USA
[3] US Army, Ctr C5ISR, 6662 Gunner Circle, Aberdeen Proving Ground, MD USA
来源
SENSORS AND SYSTEMS FOR SPACE APPLICATIONS XVII | 2024年 / 13062卷
关键词
Machine Learning; Task Scheduling; Apache Storm; Long Short-Term Memory; Convolutional Neural Networks; Deep Belief Networks;
D O I
10.1117/12.3021842
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Apache Storm is a popular open-source distributed computing platform for real-time big-data processing. However, the existing task scheduling algorithms for Apache Storm do not adequately take into account the heterogeneity and dynamics of node computing resources and task demands, leading to high processing latency and suboptimal performance. In this thesis, we propose an innovative machine learning-based task scheduling scheme tailored for Apache Storm. The scheme leverages machine learning models to predict task performance and assigns a task to the computation node with the lowest predicted processing latency. In our design, each node operates a machine learning-based monitoring mechanism. When the master node schedules a new task, it queries the computation nodes obtains their available resources, and processes latency predictions to make the optimal assignment decision. We explored three machine learning models, including Long Short-Term Memory (LSTM), Convolutional Neural Networks (CNN), and Deep Belief Networks (DBN). Our experiments showed that LSTM achieved the most accurate latency predictions. The evaluation results demonstrate that Apache Storm with the proposed LSTM-based scheduling scheme significantly improves the task processing delay and resource utilization, compared to the existing algorithms.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] A machine learning-based real-time tumor tracking system for fluoroscopic gating of lung radiotherapy
    Sakata, Yukinobu
    Hirai, Ryusuke
    Kobuna, Kyoka
    Tanizawa, Akiyuki
    Mori, Shinichiro
    PHYSICS IN MEDICINE AND BIOLOGY, 2020, 65 (08)
  • [32] Machine Learning-Based Real-Time Anomaly Detection for Unmanned Aerial Vehicles with a Cloud Server
    Jeong, Hyeok-June
    Lee, Myung-Jae
    Lee, Chang Eun
    Kim, Sung-Noon
    Ha, Young-Guk
    JOURNAL OF INTERNET TECHNOLOGY, 2017, 18 (04): : 823 - 832
  • [33] Machine Learning-based Product Recommendation using Apache Spark
    Chen, Lin
    Li, Rui
    Liu, Yige
    Zhang, Ruixuan
    Woodbridge, Diane Myung-kyung
    2017 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTED, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI), 2017,
  • [34] Development of machine learning-based real time scheduling systems: using ensemble based on wrapper feature selection approach
    Shiue, Yeou-Ren
    Guh, Ruey-Shiang
    Lee, Ken-Chun
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2012, 50 (20) : 5887 - 5905
  • [35] Real-time Learning-based Monitoring System for Water Contamination
    Chen, Qi
    Cheng, Guanghua
    Fang, Yajun
    Liu, Yang
    Zhang, Zejun
    Gao, Yiyang
    Horn, Berthold K. P.
    2018 4TH INTERNATIONAL CONFERENCE ON UNIVERSAL VILLAGE (IEEE UV 2018): HUMANKIND IN HARMONY WITH NATURE THROUGH WISE USE OF TECHNOLOGY, 2018,
  • [36] A Task Scheduling Approach for Real-Time Stream Processing
    Chen Meng-meng
    Zhuang Chuang
    Li Zhao
    Xu Ke-fu
    2014 INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA (CCBD), 2014, : 160 - 167
  • [37] ERTSim: An Embedded Real-time Task Simulator for Scheduling
    Pillai, Anju S.
    Isha, T. B.
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (ICCIC), 2013, : 724 - 727
  • [38] AREP: an adaptive, machine learning-based algorithm for real-time anomaly detection on network telemetry data
    Farkas, Karoly
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (08) : 6079 - 6094
  • [39] Machine learning-based real-time monitoring system for smart connected worker to improve energy efficiency
    Bian, Shijie
    Li, Chen
    Fu, Yongwei
    Ren, Yutian
    Wu, Tongzi
    Li, Guann-Pyng
    Li, Bingbing
    JOURNAL OF MANUFACTURING SYSTEMS, 2021, 61 : 66 - 76
  • [40] A Machine Learning-Based Tropospheric Prediction Approach for High-Precision Real-Time GNSS Positioning
    Chen, Jianping
    Gao, Yang
    SENSORS, 2024, 24 (10)