Big Data Transfer Optimization Based on Offline Knowledge Discovery and Adaptive Sampling

被引:0
|
作者
Nine, Md S. Q. Zulkar [1 ]
Guner, Kemal [1 ]
Huang, Ziyun [1 ]
Wang, Xiangyu [1 ]
Xu, Jinhui [1 ]
Kosar, Tevfik [1 ]
机构
[1] SUNY Buffalo, Dept Comp Sci & Engn, Buffalo, NY 14260 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The amount of data moved over dedicated and non-dedicated network links increases much faster than the increase in the network capacity, but the current solutions fail to guarantee even the promised achievable transfer throughputs. In this paper, we propose a novel dynamic throughput optimization model based on mathematical modeling with offline knowledge discovery/analysis and adaptive online decision making. In offline analysis, we mine historical transfer logs to perform knowledge discovery about the transfer characteristics. Online phase uses the discovered knowledge from the offline analysis along with real-time investigation of the network condition to optimize the protocol parameters. As real-time investigation is expensive and provides partial knowledge about the current network status, our model uses historical knowledge about the network and data to reduce the real-time investigation overhead while ensuring near optimal throughput for each transfer. Our novel approach is tested over different networks with different datasets and outperformed its closest competitor by 1.7x and the default case by 5x. It also achieved up to 93% accuracy compared with the optimal achievable throughput possible on those networks.
引用
收藏
页码:465 / 472
页数:8
相关论文
共 50 条
  • [1] Sampling and Evaluating the Big Data for Knowledge Discovery
    Sung, Andrew H.
    Ribeiro, Bernardete
    Liu, Qingzhong
    IOTBD: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTERNET OF THINGS AND BIG DATA, 2016, : 378 - 382
  • [2] Intelligent Method for Adaptive In Silico Knowledge Discovery Based on Big Genomic Data Analytics
    Borovska, Plamenka
    Ivanova, Desislava
    PROCEEDINGS OF THE 44TH INTERNATIONAL CONFERENCE "APPLICATIONS OF MATHEMATICS IN ENGINEERING AND ECONOMICS", 2018, 2048
  • [3] Platform for Adaptive Knowledge Discovery and Decision Making Based on Big Genomics Data Analytics
    Borovska, Plamenka
    Gancheva, Veska
    Georgiev, Ivailo
    BIOINFORMATICS AND BIOMEDICAL ENGINEERING (IWBBIO 2019), PT II, 2019, 11466 : 297 - 308
  • [4] Big Data knowledge discovery
    Xhafa, Fatos
    Taniar, David
    KNOWLEDGE-BASED SYSTEMS, 2015, 79 : 1 - 2
  • [5] Big data transfer optimization through adaptive parameter tuning
    Arslan, Engin
    Pehlivan, Bahadir A.
    Kosar, Tevfik
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2018, 120 : 89 - 100
  • [6] A Scalable Adaptive Sampling Based Approach for Big Data Classification
    Djouzi, Kheyreddine
    Beghdad-Bey, Kadda
    Amamra, Abdenour
    ADVANCES IN COMPUTING SYSTEMS AND APPLICATIONS, 2022, 513 : 73 - 83
  • [7] Big data analytics and knowledge discovery
    Bellatreche, Ladjel
    Mohania, Mukesh
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2016, 28 (15): : 3945 - 3947
  • [8] Towards Knowledge Discovery in Big Data
    Lomotey, Richard K.
    Deters, Ralph
    2014 IEEE 8TH INTERNATIONAL SYMPOSIUM ON SERVICE ORIENTED SYSTEM ENGINEERING (SOSE), 2014, : 181 - 191
  • [9] Crop Knowledge Discovery Based on Agricultural Big Data Integration
    Ngo, Vuong M.
    Kechadi, M-Tahar
    ICMLSC 2020: PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND SOFT COMPUTING, 2020, : 46 - 50
  • [10] Big Data Analytics and Knowledge Discovery
    Golfarelli, Matteo
    Wrembel, Robert
    DATA & KNOWLEDGE ENGINEERING, 2023, 146