ArtMentor: AI-Assisted Evaluation of Artworks to Explore Multimodal Large Language Models Capabilities

被引:0
作者
Zheng, Chanjin [1 ,2 ]
Yu, Zengyi [2 ,3 ]
Jiang, Yilin [3 ]
Zhang, Mingzi [2 ,3 ]
Lu, Xunuo [4 ]
Jin, Jing [5 ,6 ]
Gao, Liteng [7 ]
机构
[1] East China Normal Univ, Shanghai Inst Artificial Intelligence Educ, Shanghai, Peoples R China
[2] East China Normal Univ, Fac Educ, Shanghai, Peoples R China
[3] Zhejiang Univ Technol, Coll Educ, Hangzhou, Zhejiang, Peoples R China
[4] Zhejiang Univ Technol, Sch Econ, Hangzhou, Zhejiang, Peoples R China
[5] Zhejiang Normal Univ, Sch Educ, Jinhua, Zhejiang, Peoples R China
[6] Tianchang Guanchao Primary Sch, Hangzhou, Zhejiang, Peoples R China
[7] Univ Shanghai Sci & Technol, Sch Artificial Intelligence Sci & Technol, Shanghai, Peoples R China
来源
PROCEEDINGS OF THE 2025 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYTEMS, CHI 2025 | 2025年
关键词
AI-Assisted Artwork Evaluation; GPT-4o; Multimodal Large Language Models; Human-Computer Interaction Dataset Design; Entity Recognition; Multi-Agent for Iterative Upgrades;
D O I
10.1145/3706598.3713274
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Can Multimodal Large Language Models (MLLMs), with capabilities in perception, recognition, understanding, and reasoning, act as independent assistants in art evaluation dialogues? Current MLLM evaluation methods, reliant on subjective human scoring or costly interviews, lack comprehensive scenario coverage. This paper proposes a process-oriented Human-Computer Interaction (HCI) space design for more accurate MLLM assessment and development. This approach aids teachers in efficient art evaluation and records interactions for MLLM capability assessment. We introduce ArtMentor, a comprehensive space integrating a dataset and three systems for optimized MLLM evaluation. It includes 380 sessions from five art teachers across nine critical dimensions. The modular system features entity recognition, review generation, and suggestion generation agents, enabling iterative upgrades. Machine learning and natural language processing ensure reliable evaluations. Results confirm GPT-4o's effectiveness in assisting teachers in art evaluation dialogues. Our contributions are available at https://artmentor.github.io/.
引用
收藏
页数:18
相关论文
共 70 条
[51]  
Pylypchuk Oksana, 2021, 2021 INT C SOC SCI B, P181
[52]   Initial Images: Using Image Prompts to Improve Subject Representation in Multimodal AI Generated Art [J].
Qiao, Han ;
Liu, Vivian ;
Chilton, Lydia B. .
PROCEEDINGS OF THE 14TH CREATIVITY AND COGNITION, C&C 2022, 2022, :15-28
[53]   Rule-selection and action-selection have a shared neuroanatomical basis in the human prefrontal and parietal cortex [J].
Rowe, J. ;
Hughes, L. ;
Eckstein, D. ;
Owen, A. M. .
CEREBRAL CORTEX, 2008, 18 (10) :2275-2285
[54]   Capturing the Imagination: Arts-Informed Inquiry as a Method in Program Evaluation [J].
Searle, Michelle J. ;
Shulha, Lyn M. .
CANADIAN JOURNAL OF PROGRAM EVALUATION, 2016, 31 (01) :34-60
[55]  
Seo Woosuk, 2022, ECSCW
[56]   Discovering the Defects in Paintings Using Non-destructive Testing (NDT) Techniques and Passing Through Measurements of Deformation [J].
Sfarra, S. ;
Ibarra-Castanedo, C. ;
Ambrosini, D. ;
Paoletti, D. ;
Bendada, A. ;
Maldague, X. .
JOURNAL OF NONDESTRUCTIVE EVALUATION, 2014, 33 (03) :358-383
[57]  
Shi P., 1992, Proceedings of the 1992 American Control Conference (IEEE Cat. No.92CH3072-6), P1976
[58]   The multi-agent data collection in HLA-based simulation system [J].
Song, Hen-Jie ;
Shen, Zhi-Qi ;
Miao, Chun-Yan ;
Tan, Ah-Hwee ;
Zhao, Guo-Peng .
21ST INTERNATIONAL WORKSHOP ON PRINCIPLES OF ADVANCED AND DISTRIBUTED SIMULATION, PROCEEDINGS, 2007, :61-+
[59]  
Talebirad Y, 2023, Arxiv, DOI arXiv:2306.03314
[60]   From Local Patterns to Global Models: Towards Domain Driven Educational Process Mining [J].
Trcka, Nikola ;
Pechenizkiy, Mykola .
2009 9TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, 2009, :1114-1119