Could ChatGPT Imagine: Content Control for Artistic Painting Generation Via Large Language Models

被引:7
作者
Lu, Yue [1 ]
Guo, Chao [2 ]
Dou, Yong [3 ]
Dai, Xingyuan [2 ]
Wang, Fei-Yue [2 ]
机构
[1] Shandong Univ, Sch Control Sci & Engn, Jinan 250061, Peoples R China
[2] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing, Peoples R China
[3] Macao Univ Sci & Technol, Macao Inst Syst Engn, Macau 999078, Peoples R China
关键词
Intelligent systems; Human-machine interactions; Artistic painting generation; Large language model; ChatGPT; Linguistic intelligence; PARALLEL; METAVERSES;
D O I
10.1007/s10846-023-01956-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Intelligent systems and human-machine interactions have consistently provided convenience in both work and daily life. Artificial Intelligence Generated Content (AIGC) can assist humans in artistic creation by generating painting images based on textual descriptions. However, the quality of generated painting images depends heavily on well-designed prompts, which are labor-intensive and time-consuming in painting creation. Large Language Models (LLMs) like ChatGPT have shown impressive performance in linguistic tasks such as question answering and logical inference, demonstrating strong linguistic intelligence. This paper proposes an assistant painting creation approach to provide precise content control for painting generation by combining LLMs with text-to-image generative models and evaluates the performance of the proposed approach on painting content generation and painting element arrangement. The experimental results show that our approach can provide clear guidance on rich painting content and reasonable arrangements of painting elements, demonstrating its ability of text-based painting scene imagination. In painting generation tasks, LLMs like ChatGPT can help the text-to-image models with precise control over the painting content and improve the overall painting results.
引用
收藏
页数:15
相关论文
共 68 条
[1]  
Antaki F., 2023, medRxiv, P2023
[2]  
Bang Y, 2023, Arxiv, DOI [arXiv:2302.04023, DOI 10.48550/ARXIV.2302.04023]
[3]  
Bubeck S, 2023, Arxiv, DOI [arXiv:2303.12712, 10.48550/arXiv.2303.12712, DOI 10.48550/ARXIV.2303.12712]
[4]   Research on Navigation Line Extraction of Garden Mobile Robot Based on Edge Detection [J].
Chen, Jiqing ;
Wang, Zhikui ;
Long, Teng ;
Wu, Jiahua ;
Cai, Ganwei ;
Zhang, Hongdu .
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2022, 105 (02)
[5]   Image-based traffic signal control via world models [J].
Dai, Xingyuan ;
Zhao, Chen ;
Wang, Xiao ;
Lv, Yisheng ;
Lin, Yilun ;
Wang, Fei-Yue .
FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2022, 23 (12) :1795-1813
[6]  
Ding BS, 2023, Arxiv, DOI [arXiv:2212.10450, DOI 10.48550/ARXIV.2212.10450, 10.48550/arXiv.2212.10450]
[7]   sBotics-Gamified Framework for Educational Robotics [J].
do Nascimento, Lucas Moura ;
Neri, Davi Souto ;
Ferreira, Thiago do Nascimento ;
Pereira, Francinaldo de Almeida ;
Albuquerque, Erika Akemi Yanaguibashi ;
Goncalves, Luiz Marcos Garcia ;
Sa, Sarah Thomaz de Lima .
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2021, 102 (01)
[8]   Cognitive-Based Crack Detection for Road Maintenance: An Integrated System in Cyber-Physical-Social Systems [J].
Fan, Lili ;
Cao, Dongpu ;
Zeng, Changxian ;
Li, Bai ;
Li, Yunjie ;
Wang, Fei-Yue .
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (06) :3485-3500
[9]  
Frieder S, 2023, Arxiv, DOI [arXiv:2301.13867, DOI 10.48550/ARXIV.2301.13867]
[10]  
Guo BY, 2023, Arxiv, DOI [arXiv:2301.07597, 10.48550/arXiv.2301.07597 2301.07597, 10.48550/arXiv.2301.07597]