Foundation and large language models: fundamentals, challenges, opportunities, and social impacts

被引:0
|
作者
Devon Myers
Rami Mohawesh
Venkata Ishwarya Chellaboina
Anantha Lakshmi Sathvik
Praveen Venkatesh
Yi-Hui Ho
Hanna Henshaw
Muna Alhawawreh
David Berdik
Yaser Jararweh
机构
[1] Duquesne University,
[2] Al Ain University,undefined
[3] Deakin University,undefined
来源
Cluster Computing | 2024年 / 27卷
关键词
Natural language processing; Foundation models; Large language models; Advanced pre-trained models; Artificial intelligence; Machine learning;
D O I
暂无
中图分类号
学科分类号
摘要
Foundation and Large Language Models (FLLMs) are models that are trained using a massive amount of data with the intent to perform a variety of downstream tasks. FLLMs are very promising drivers for different domains, such as Natural Language Processing (NLP) and other AI-related applications. These models emerged as a result of the AI paradigm shift, involving the use of pre-trained language models (PLMs) and extensive data to train transformer models. FLLMs have also demonstrated impressive proficiency in addressing a wide range of NLP applications, including language generation, summarization, comprehension, complex reasoning, and question answering, among others. In recent years, there has been unprecedented interest in FLLMs-related research, driven by contributions from both academic institutions and industry players. Notably, the development of ChatGPT, a highly capable AI chatbot built around FLLMs concepts, has garnered considerable interest from various segments of society. The technological advancement of large language models (LLMs) has had a significant influence on the broader artificial intelligence (AI) community, potentially transforming the processes involved in the development and use of AI systems. Our study provides a comprehensive survey of existing resources related to the development of FLLMs and addresses current concerns, challenges and social impacts. Moreover, we emphasize on the current research gaps and potential future directions in this emerging and promising field.
引用
收藏
页码:1 / 26
页数:25
相关论文
共 50 条
  • [41] Mapping the individual, social, and biospheric impacts of Foundation Models
    Hernandez, Andres Dominguez
    Krishna, Shyam
    Perini, Antonella Maia
    Katell, Michael
    Bennett, S. J.
    Borda, Ann
    Hashem, Youmna
    Hadjiloizou, Semeli
    Mahomed, Sabeehah
    Jayadeva, Smera
    Aitken, Mhairi
    Leslie, David
    PROCEEDINGS OF THE 2024 ACM CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, ACM FACCT 2024, 2024, : 776 - 796
  • [42] Large Language Models for Networking: Applications, Enabling Techniques, and Challenges
    Huang, Yudong
    Du, Hongyang
    Zhang, Xinyuan
    Niyato, Dusit
    Kang, Jiawen
    Xiong, Zehui
    Wang, Shuo
    Huang, Tao
    IEEE NETWORK, 2025, 39 (01): : 235 - 242
  • [43] ChatGPT, Bard, and Large Language Models for Biomedical Research: Opportunities and Pitfalls
    Thapa, Surendrabikram
    Adhikari, Surabhi
    ANNALS OF BIOMEDICAL ENGINEERING, 2023, 51 (12) : 2647 - 2651
  • [44] ChatGPT, Bard, and Large Language Models for Biomedical Research: Opportunities and Pitfalls
    Surendrabikram Thapa
    Surabhi Adhikari
    Annals of Biomedical Engineering, 2023, 51 : 2647 - 2651
  • [45] Large Language Models and Healthcare Alliance: Potential and Challenges of Two Representative Use Cases
    Garcia-Mendez, Silvia
    de Arriba-Perez, Francisco
    ANNALS OF BIOMEDICAL ENGINEERING, 2024, 52 (08) : 1928 - 1931
  • [46] Rethinking Data-driven Networking with Foundation Models: Challenges and Opportunities
    Le, Franck
    Srivatsa, Mudhakar
    Ganti, Raghu
    Sekar, Vyas
    THE 21ST ACM WORKSHOP ON HOT TOPICS IN NETWORKS, HOTNETS 2022, 2022, : 188 - 197
  • [47] Open challenges and opportunities in federated foundation models towards biomedical healthcare
    Li, Xingyu
    Peng, Lu
    Wang, Yu-Ping
    Zhang, Weihua
    BIODATA MINING, 2025, 18 (01):
  • [48] ChatGPT/GPT-4 (large language models): Opportunities and challenges of perspective in bariatric healthcare professionals
    Law, Saikam
    Oldfield, Brian
    Yang, Wah
    OBESITY REVIEWS, 2024, 25 (07)
  • [49] Using large language models to generate silicon samples in consumer and marketing research: Challenges, opportunities, and guidelines
    Sarstedt, Marko
    Adler, Susanne J.
    Rau, Lea
    Schmitt, Bernd
    PSYCHOLOGY & MARKETING, 2024, 41 (06) : 1254 - 1270
  • [50] Large Language Model in Critical Care Medicine: Opportunities and Challenges
    Hajijama, Sameera
    Juneja, Deven
    Nasa, Prashant
    INDIAN JOURNAL OF CRITICAL CARE MEDICINE, 2024, 28 (06) : 523 - 525