A graph-based big data optimization approach using hidden Markov model and constraint satisfaction problem

被引:0
作者
Imad Sassi
Samir Anter
Abdelkrim Bekkhoucha
机构
[1] Computer Science Laboratory (LIM),
[2] FSTM,undefined
[3] Hassan II University,undefined
来源
Journal of Big Data | / 8卷
关键词
Machine learning; Big data analytics; Optimization; Metaheuristics; Time series forecasting; Graphical modeling;
D O I
暂无
中图分类号
学科分类号
摘要
To address the challenges of big data analytics, several works have focused on big data optimization using metaheuristics. The constraint satisfaction problem (CSP) is a fundamental concept of metaheuristics that has shown great efficiency in several fields. Hidden Markov models (HMMs) are powerful machine learning algorithms that are applied especially frequently in time series analysis. However, one issue in forecasting time series using HMMs is how to reduce the search space (state and observation space). To address this issue, we propose a graph-based big data optimization approach using a CSP to enhance the results of learning and prediction tasks of HMMs. This approach takes full advantage of both HMMs, with the richness of their algorithms, and CSPs, with their many powerful and efficient solver algorithms. To verify the validity of the model, the proposed approach is evaluated on real-world data using the mean absolute percentage error (MAPE) and other metrics as measures of the prediction accuracy. The conducted experiments show that the proposed model outperforms the conventional model. It reduces the MAPE by 0.71% and offers a particularly good trade-off between computational costs and the quality of results for large datasets. It is also competitive with benchmark models in terms of the running time and prediction accuracy. Further comparisons substantiate these experimental findings.
引用
收藏
相关论文
共 150 条
  • [1] Tsai CW(2015)Big data analytics: a survey J Big Data. 2 1-31
  • [2] Lai CF(2020)A review of machine learning for big data analytics: bibliometric approach Technol Anal Strateg Manag. 32 984-1005
  • [3] Chao HC(2019)Uncertainty in big data analytics: survey, opportunities, and challenges J Big Data. 6 1-16
  • [4] Vasilakos AV(2017)Big data: Dimensions, evolution, impacts, and challenges Business Horizons. 60 293-303
  • [5] El-Alfy ESM(2018)An overview of big data and machine learning paradigms Int Conf Adv Intell Syst Sustain Dev 915 237-251
  • [6] Mohammed SA(2020)Hybrid gradient descent spider monkey optimization (HGDSMO) algorithm for efficient resource scheduling for big data processing in heterogenous environment J Big Data. 7 1-25
  • [7] Hariri RH(2020)Anomaly detection optimization using big data and deep learning to reduce false-positive J Big Data. 7 1-12
  • [8] Fredericks EM(2021)The arithmetic optimization algorithm Comput Methods Appl Mech Eng. 376 113609-20
  • [9] Bowers KM(2020)A systematic review of hidden markov models and their applications Arch Comput Methods Eng 28 1-11
  • [10] Lee I(2020)A novel approach for modeling positive vectors with inverted dirichlet-based hidden markov models Knowl Based Syst. 192 105335-33