Study on the Performance Optimization and Application of Big Model in Big Data Processing

被引:0
作者
Wen, Zebin [1 ]
Wang, Ping [1 ]
Zhang, Jiuyang [1 ]
Xiong, Ping [1 ]
机构
[1] Guangdong Univ Sci & Technol, Dongguan, Guangdong, Peoples R China
来源
2024 6TH INTERNATIONAL CONFERENCE ON DATA-DRIVEN OPTIMIZATION OF COMPLEX SYSTEMS, DOCS 2024 | 2024年
关键词
Big Data Processing; Data Mining; Parallel Computing; Feature Engineering; Data Preprocessing;
D O I
10.1109/DOCS63458.2024.10704388
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the surge in data volume, big data processing faces unprecedented challenges, among which large models have become a hot research topic due to their powerful data processing capabilities. This paper delves into the performance bottlenecks of large models in big data processing and proposes a series of performance optimization strategies. Through a review of existing data processing technologies and large model architectures, combined with optimization theory and practice, this study introduces a comprehensive optimization mechanism that includes resource scheduling, model computational efficiency, and storage and IO. Experiments were conducted in a cloud computing environment to validate these strategies. The results indicate that the optimization strategies significantly enhanced performance when processing different scales of data, improved load balancing and resource utilization, and increased system stability. This research enriches the theoretical study of big data processing and provides effective optimization avenues for the practical application of large models in fields such as data mining and parallel computing. It offers guidance for feature engineering and data preprocessing and paves the way for future research directions.
引用
收藏
页码:650 / 657
页数:8
相关论文
共 15 条
  • [1] Search for the Standard Model Higgs boson in the H → WW(*) → lvlv decay mode with 4.7 fb-1 of ATLAS data at √s=7 TeV
    Aad, G.
    Abbott, B.
    Abdallah, J.
    Khalek, S. Abdel
    Abdelalim, A. A.
    Abdinov, O.
    Abi, B.
    Abolins, M.
    AbouZeid, O. S.
    Abramowicz, H.
    Abreu, H.
    Acerbi, E.
    Acharya, B. S.
    Adamczyk, L.
    Adams, D. L.
    Addy, T. N.
    Adelman, J.
    Adomeit, S.
    Adragna, P.
    Adye, T.
    Aefsky, S.
    Aguilar-Saavedra, J. A.
    Agustoni, M.
    Aharrouche, M.
    Ahlen, S. P.
    Ahles, F.
    Ahmad, A.
    Ahsan, M.
    Aielli, G.
    Akdogan, T.
    Akesson, T. P. A.
    Akimoto, G.
    Akimov, A. V.
    Alam, M. S.
    Alam, M. A.
    Albert, J.
    Albrand, S.
    Aleksa, M.
    Aleksandrov, I. N.
    Alessandria, F.
    Alexa, C.
    Alexander, G.
    Alexandre, G.
    Alexopoulos, T.
    Alhroob, M.
    Aliev, M.
    Alimonti, G.
    Alison, J.
    Allbrooke, B. M. M.
    Allport, P. P.
    [J]. PHYSICS LETTERS B, 2012, 716 (01) : 62 - 81
  • [2] [冯兴杰 Feng Xingjie], 2019, [计算机应用研究, Application Research of Computers], V36, P18
  • [3] Huang QF, 2020, 2020 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA (ICAIBD 2020), P149, DOI [10.1109/icaibd49809.2020.9137455, 10.1109/ICAIBD49809.2020.9137455]
  • [4] Jiang Weiwei, 2020, Journal of Southwest University for Nationalities (Humanities and Social Sciences edition), V41, P234
  • [5] Li Xinyao, Journal of Computer Science
  • [6] PAUL P K, 2018, J. Social Science Electronic Publishing, V10, P96
  • [7] An application of the dynamic knowledge creation model in big data
    Philip, Jestine
    [J]. TECHNOLOGY IN SOCIETY, 2018, 54 : 120 - 127
  • [8] Adaptive Computing-Plus-Communication Optimization Framework for Multimedia Processing in Cloud Systems
    Shojafar, Mohammad
    Canali, Claudia
    Lancellotti, Riccardo
    Abawajy, Jemal
    [J]. IEEE TRANSACTIONS ON CLOUD COMPUTING, 2020, 8 (04) : 1162 - 1175
  • [9] Sun Xiaohua, Journal of Dalian University of Technology
  • [10] [吴江 Wu Jiang], 2019, [数据分析与知识发现, Data Analysis and Knowledge Discovery], V3, P2