Balancing Performance and Energy Consumption of Bagging Ensembles for the Classification of Data Streams in Edge Computing

被引:2
作者
Cassales, Guilherme [1 ]
Gomes, Heitor Murilo [2 ]
Bifet, Albert [1 ]
Pfahringer, Bernhard [1 ]
Senger, Hermes [3 ]
机构
[1] Univ Waikato, Dept Comp Sci, Hamilton 3240, New Zealand
[2] Victoria Univ Wellington, Sch Engn & Comp Sci, Wellington 6140, New Zealand
[3] Univ Fed Sao Carlos, Dept Comp Sci, BR-13565905 Sao Carlos, Brazil
来源
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT | 2023年 / 20卷 / 03期
基金
巴西圣保罗研究基金会;
关键词
Edge computing; machine learning; data stream classification; ensembles; energy consumption;
D O I
10.1109/TNSM.2022.3226505
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, the Edge Computing (EC) paradigm has emerged as an enabling factor for developing technologies like the Internet of Things (IoT) and 5G networks, bridging the gap between Cloud Computing services and end-users, supporting low latency, mobility, and location awareness to delay-sensitive applications. An increasing number of solutions in EC have employed machine learning (ML) methods to perform data classification and other information processing tasks on continuous and evolving data streams. Usually, such solutions have to cope with vast amounts of data that come as data streams while balancing energy consumption, latency, and the predictive performance of the algorithms. Ensemble methods achieve remarkable predictive performance when applied to evolving data streams due to several models and the possibility of selective resets. This work investigates a strategy that introduces short intervals to defer the processing of mini-batches. Well balanced, our strategy can improve the performance (i.e., delay, throughput) and reduce the energy consumption of bagging ensembles to classify data streams. The experimental evaluation involved six state-of-art ensemble algorithms (OzaBag, OzaBag Adaptive Size Hoeffding Tree, Online Bagging ADWIN, Leveraging Bagging, Adaptive RandomForest, and Streaming Random Patches) applying five widely used machine learning benchmark datasets with varied characteristics on three computer platforms. As a result, our strategy can significantly reduce energy consumption in 96% of the experimental scenarios evaluated. Despite the trade-offs, it is possible to balance them to avoid significant loss in predictive performance.
引用
收藏
页码:3038 / 3054
页数:17
相关论文
共 37 条
[1]  
Amezzane I., 2019, P 4 WORLD C COMPL SY, P1
[2]   The case for energy-proportional computing [J].
Barroso, Luiz Andre ;
Hoelzle, Urs .
COMPUTER, 2007, 40 (12) :33-+
[3]   A survey of design techniques for system-level dynamic power management [J].
Benini, L ;
Bogliolo, A ;
De Micheli, G .
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2000, 8 (03) :299-316
[4]  
Bifet A, 2010, JMLR WORKSH CONF PRO, V11, P44
[5]  
Bifet A, 2010, LECT NOTES ARTIF INT, V6321, P135, DOI 10.1007/978-3-642-15880-3_15
[6]  
Bifet A, 2009, KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, P139
[7]  
Bifet A, 2007, PROCEEDINGS OF THE SEVENTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, P443
[8]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[9]   Bagging predictors [J].
Breiman, L .
MACHINE LEARNING, 1996, 24 (02) :123-140
[10]  
Cassales Guilherme, 2020, 2020 IEEE 22nd International Conference on High Performance Computing and Communications