Big data grace: Implementations of the feature engineering and data science algorithms for environmental protection law

被引:0
|
作者
Wu, Wenyue [1 ]
Zhao, Yiming [2 ]
机构
[1] Anhui Med Univ, Coll Law, Hefei 230032, Anhui, Peoples R China
[2] Anhui Univ, Sch Law, Hefei 230601, Anhui, Peoples R China
关键词
Big data; Feature engineering; Data science algorithms; Environmental protection law; Machine learning;
D O I
10.1016/j.aej.2025.03.121
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
This study is intended to predict CO2 emissions using a set of features. With this aim, three machine learning (ML) algorithms have been used, namely, support vector regression (SVR), Long Short Term Memory (LSTM), and multilayer perceptron (MLP). First of all, correlation analysis was performed which revealed a low level of multicollinearity among the set of features. Hereafter, moving towards the modeling and compared the ML models, the findings showed that SVR (Linear) is the most reliable one, showing superiority to the rest by having the least Mean Squared Error (MSE), Root Mean Squared Error (RMSE), Mean Absolute Error (MAE), and Mean Absolute Percentage Error (MAPE) values. On the contrary, the Dynamic LSTM model demonstrated the worst performance across all evaluated metrics. Specifically, it showed the highest values for MSE, RMSE, MAE, and MAPE. Static LSTM and SVR (RBF) models performed moderately, with Static LSTM marginally outperforming SVR (RBF) on the evaluation metrics like MAE and MSE. This will provide insight into guiding policy decisions in the future regarding strategies on environmental management and demographics development. This study highlights ML's role in environmental monitoring, aiding policymakers with data-driven strategies to reduce CO2 emissions and shape sustainable policies.
引用
收藏
页码:256 / 264
页数:9
相关论文
共 50 条
  • [21] Data science, big data and statistics
    Galeano, Pedro
    Pena, Daniel
    TEST, 2019, 28 (02) : 289 - 329
  • [22] Statistics, data science, and big data
    Kauermann G.
    Küchenhoff H.
    AStA Wirtschafts- und Sozialstatistisches Archiv, 2016, 10 (2-3) : 141 - 150
  • [23] Data Science vs Big Data @ UTM Big Data Centre
    Shamsuddin, Siti Mariyam
    Hasan, Shafaatunnur
    2015 International Conference on Science in Information Technology (ICSITech), 2015, : 1 - 4
  • [24] A Data Science and Engineering Solution for Fast k-Means Clustering of Big Data
    Dierckens, Karl E.
    Harrison, Adrian B.
    Leung, Carson K.
    Pind, Adrienne V.
    2017 16TH IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS / 11TH IEEE INTERNATIONAL CONFERENCE ON BIG DATA SCIENCE AND ENGINEERING / 14TH IEEE INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS, 2017, : 925 - 932
  • [25] The Challenge of Big Data and Data Science
    Brady, Henry E.
    ANNUAL REVIEW OF POLITICAL SCIENCE, VOL 22, 2019, 22 : 297 - 323
  • [26] Big Data Science
    Morik, Katharina
    Bockermann, Christian
    Buschjaeger, Sebastian
    KUNSTLICHE INTELLIGENZ, 2018, 32 (01): : 27 - 36
  • [27] Big data in forensic science and medicine
    Lefevre, Thomas
    JOURNAL OF FORENSIC AND LEGAL MEDICINE, 2018, 57 : 1 - 6
  • [28] Big genetic data and its big data protection challenges
    Quinn, Paul
    Quinn, Liam
    COMPUTER LAW & SECURITY REVIEW, 2018, 34 (05) : 1000 - 1018
  • [29] Big Data, Data Science, and Causal Inference: A Primer for Clinicians
    Raita, Yoshihiko
    Camargo, Carlos A.
    Liang, Liming
    Hasegawa, Kohei
    FRONTIERS IN MEDICINE, 2021, 8
  • [30] Data protection in heterogeneous big data systems
    Poltavtseva, M. A.
    Aleksandrova, E. B.
    Shmatov, V. S.
    Zegzhda, P. D.
    JOURNAL OF COMPUTER VIROLOGY AND HACKING TECHNIQUES, 2023, 19 (03) : 451 - 458