ArtMentor: AI-Assisted Evaluation of Artworks to Explore Multimodal Large Language Models Capabilities

被引:0
作者
Zheng, Chanjin [1 ,2 ]
Yu, Zengyi [2 ,3 ]
Jiang, Yilin [3 ]
Zhang, Mingzi [2 ,3 ]
Lu, Xunuo [4 ]
Jin, Jing [5 ,6 ]
Gao, Liteng [7 ]
机构
[1] East China Normal Univ, Shanghai Inst Artificial Intelligence Educ, Shanghai, Peoples R China
[2] East China Normal Univ, Fac Educ, Shanghai, Peoples R China
[3] Zhejiang Univ Technol, Coll Educ, Hangzhou, Zhejiang, Peoples R China
[4] Zhejiang Univ Technol, Sch Econ, Hangzhou, Zhejiang, Peoples R China
[5] Zhejiang Normal Univ, Sch Educ, Jinhua, Zhejiang, Peoples R China
[6] Tianchang Guanchao Primary Sch, Hangzhou, Zhejiang, Peoples R China
[7] Univ Shanghai Sci & Technol, Sch Artificial Intelligence Sci & Technol, Shanghai, Peoples R China
来源
PROCEEDINGS OF THE 2025 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYTEMS, CHI 2025 | 2025年
关键词
AI-Assisted Artwork Evaluation; GPT-4o; Multimodal Large Language Models; Human-Computer Interaction Dataset Design; Entity Recognition; Multi-Agent for Iterative Upgrades;
D O I
10.1145/3706598.3713274
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Can Multimodal Large Language Models (MLLMs), with capabilities in perception, recognition, understanding, and reasoning, act as independent assistants in art evaluation dialogues? Current MLLM evaluation methods, reliant on subjective human scoring or costly interviews, lack comprehensive scenario coverage. This paper proposes a process-oriented Human-Computer Interaction (HCI) space design for more accurate MLLM assessment and development. This approach aids teachers in efficient art evaluation and records interactions for MLLM capability assessment. We introduce ArtMentor, a comprehensive space integrating a dataset and three systems for optimized MLLM evaluation. It includes 380 sessions from five art teachers across nine critical dimensions. The modular system features entity recognition, review generation, and suggestion generation agents, enabling iterative upgrades. Machine learning and natural language processing ensure reliable evaluations. Results confirm GPT-4o's effectiveness in assisting teachers in art evaluation dialogues. Our contributions are available at https://artmentor.github.io/.
引用
收藏
页数:18
相关论文
共 70 条
[1]   Cognitive tutors: Lessons learned [J].
Anderson, JR ;
Corbett, AT ;
Koedinger, KR ;
Pelletier, R .
JOURNAL OF THE LEARNING SCIENCES, 1995, 4 (02) :167-207
[2]  
Anil R, 2023, arXiv
[3]  
Awadalla A, 2023, Arxiv, DOI [arXiv:2308.01390, DOI 10.48550/ARXIV.2308.01390, 10.48550/arXiv.2308.01390]
[4]  
Bai JZ, 2023, Arxiv, DOI [arXiv:2308.12966, 10.48550/arXiv.2308.12966]
[5]  
Bai S, 2023, Arxiv, DOI arXiv:2308.16890
[6]   A comprehensive survey on object detection in Visual Art: taxonomy and challenge [J].
Bengamra, Siwar ;
Mzoughi, Olfa ;
Bigand, Andre ;
Zagrouba, Ezzeddine .
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (05) :14637-14670
[7]  
Bergson Henri, 1911, Essai sur les donnees immediates de la conscience
[8]  
Biswas A., 2024, Journal of Artificial Intelligence Research
[9]   Realism [J].
Biswas, Moinak .
BIOSCOPE-SOUTH ASIAN SCREEN STUDIES, 2021, 12 (1-2) :158-161
[10]   USING INTERACTION FRAMEWORK TO GUIDE THE DESIGN OF INTERACTIVE SYSTEMS [J].
BLANDFORD, AE ;
BARNARD, PJ ;
HARRISON, MD .
INTERNATIONAL JOURNAL OF HUMAN-COMPUTER STUDIES, 1995, 43 (01) :101-130