Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

被引：0

作者：

Chen, Zixiang ^{[1
]}

Deng, Yihe ^{[1
]}

Yuan, Huizhuo ^{[1
]}

Ji, Kaixuan ^{[1
]}

Gu, Quanquan ^{[1
]}

机构：

[1] Univ Calif Los Angeles, Dept Comp Sci, Los Angeles, CA 90095 USA

来源：

INTERNATIONAL CONFERENCE ON MACHINE LEARNING | 2024年 / 235卷

关键词：

GAME;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Harnessing the power of human-annotated data through Supervised Fine-Tuning (SFT) is pivotal for advancing Large Language Models (LLMs). In this paper, we delve into the prospect of growing a strong LLM out of a weak one without the need for acquiring additional human-annotated data. We propose a new fine-tuning method called Self-Play fIne-tuNing (SPIN), which starts from a supervised fine-tuned model. At the heart of SPIN lies a self-play mechanism, where the LLM refines its capability by playing against instances of itself. More specifically, the LLM generates its own training data from its previous iterations, refining its policy by discerning these self-generated responses from those obtained from human-annotated data. Our method progressively elevates the LLM from a nascent model to a formidable one, unlocking the full potential of human-annotated demonstration data for SFT. Theoretically, we prove that the global optimum to the training objective function of our method is achieved only when the LLM policy aligns with the target data distribution. Empirically, we evaluate our method on several benchmark datasets including the HuggingFace Open LLM Leaderboard, MT-Bench, and datasets from Big-Bench. Our results show that SPIN can significantly improve the LLM's performance across a variety of benchmarks and even outperform models trained through direct preference optimization (DPO) supplemented with extra GPT-4 preference data. This sheds light on the promise of self-play, enabling the achievement of human-level performance in LLMs without the need for expert opponents. Codes are available at https://github.com/uclaml/SPIN.

引用

页数：22

共 50 条

[41] Enhancing Code Language Models for Program Repair by Curricular Fine-tuning Framework
Hao, Sichong
Shi, Xianjun
Liu, Hongwei
Shu, Yanjun
2023 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION, ICSME, 2023, : 136 - 146
[42] Selective privacy-preserving framework for large language models fine-tuning
Wang, Teng
Zhai, Lindong
Yang, Tengfei
Luo, Zhucheng
Liu, Shuanggen
INFORMATION SCIENCES, 2024, 678
[43] Debiasing Pre-Trained Language Models via Efficient Fine-Tuning
Gira, Michael
Zhang, Ruisu
Lee, Kangwook
PROCEEDINGS OF THE SECOND WORKSHOP ON LANGUAGE TECHNOLOGY FOR EQUALITY, DIVERSITY AND INCLUSION (LTEDI 2022), 2022, : 59 - 69
[44] Safety Fine-Tuning at (Almost) No Cost: A Baseline for Vision Large Language Models
Zong, Yongshuo
Bohdal, Ondrej
Yu, Tingyang
Yang, Yongxin
Hospedales, Timothy
Proceedings of Machine Learning Research, 2024, 235 : 62867 - 62891
[45] Parameter-efficient fine-tuning of large language models using semantic knowledge tuning
Prottasha, Nusrat Jahan
Mahmud, Asif
Sobuj, Md. Shohanur Islam
Bhat, Prakash
Kowsher, Md
Yousefi, Niloofar
Garibay, Ozlem Ozmen
SCIENTIFIC REPORTS, 2024, 14 (01):
[46] Gender-tuning: Empowering Fine-tuning for Debiasing Pre-trained Language Models
Ghanbarzadeh, Somayeh
Huang, Yan
Palangi, Hamid
Moreno, Radames Cruz
Khanpour, Hamed
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 5448 - 5458
[47] Role of Language Relatedness in Multilingual Fine-tuning of Language Models: A Case Study in Indo-Aryan Languages
Dhamecha, Tejas Indulal
Murthy, Rudra, V
Bharadwaj, Samarth
Sankaranarayanan, Karthik
Bhattacharyya, Pushpak
2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 8584 - 8595
[48] Lexicon-based fine-tuning of multilingual language models for low-resource language sentiment analysis
Dhananjaya, Vinura
Ranathunga, Surangika
Jayasena, Sanath
CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2024, 9 (05) : 1116 - 1125
[49] Enhancing generalization in camera trap image recognition: Fine-tuning visual language models
Yang, Zihe
Tian, Ye
Wang, Lifeng
Zhang, Junguo
NEUROCOMPUTING, 2025, 634
[50] A Comparative Analysis of Instruction Fine-Tuning Large Language Models for Financial Text Classification
Fatemi, Sorouralsadat
Hu, Yuheng
Mousavi, Maryam
ACM TRANSACTIONS ON MANAGEMENT INFORMATION SYSTEMS, 2025, 16 (01)

← 1 2 3 4 5 →