Open-Source Text-to-Image Models: Evaluation using Metrics and Human Perception

被引:0
作者
Yamac, Aylin [1 ]
Genc, Dilan [1 ]
Zaman, Esra [1 ]
Gerschner, Felix [1 ]
Klaiber, Marco [1 ]
Theissler, Andreas [1 ]
机构
[1] Aalen Univ Appl Sci, Aalen, Germany
来源
2024 IEEE 48TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE, COMPSAC 2024 | 2024年
关键词
text-to-image; open-source; weaknesses;
D O I
10.1109/COMPSAC61105.2024.00261
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text-to-image models, which aim to convert text input into images, have gained popularity partly due to their flexibility and user-friendliness. However, there are still weaknesses in the generation of images intended to display emotions, visual text, multiple objects, relative positioning, and attribute binding. This study analyzes the weaknesses of three open-source models: Stable Diffusion v2-1, Openjourney, and Dreamlike Photoreal 2.0. The models are compared based on scores for quality, alignment, and aesthetics. The evaluation is based on (a) the metrics ClipScore, Frechet Inception Distance (FID), and Large-scale Artificial Intelligence Open Network (LAION) and (b) human perception obtained in user surveys. The evaluation revealed that all models show predominantly unsatisfactory performance, and the identified weaknesses were confirmed.
引用
收藏
页码:1659 / 1664
页数:6
相关论文
共 50 条
  • [41] Predicting Code Hotspots in Open-Source Software from Object-Oriented Metrics Using Machine Learning
    Hilton, Rod
    Gethner, Ellen
    [J]. INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2018, 28 (03) : 311 - 331
  • [42] Automated Generation of Lung Cytological Images from Image Findings Using Text-to-Image Technology
    Teramoto, Atsushi
    Kiriyama, Yuka
    Michiba, Ayano
    Yazawa, Natsuki
    Tsukamoto, Tetsuya
    Imaizumi, Kazuyoshi
    Fujita, Hiroshi
    [J]. COMPUTERS, 2024, 13 (11)
  • [43] LOW COST EDUCATIONAL PLATFORM FOR ROBOTICS, USING OPEN-SOURCE 3D PRINTERS AND OPEN-SOURCE HARDWARE
    Garcia-Saura, Carlos
    Gonzalez-Gomez, Juan
    [J]. 5TH INTERNATIONAL CONFERENCE OF EDUCATION, RESEARCH AND INNOVATION (ICERI 2012), 2012, : 2699 - 2706
  • [44] CustusX: an open-source research platform for image-guided therapy
    Askeland, Christian
    Solberg, Ole Vegard
    Bakeng, Janne Beate Lervik
    Reinertsen, Ingerid
    Tangen, Geir Arne
    Hofstad, Erlend Fagertun
    Iversen, Daniel Hoyer
    Vapenstad, Cecilie
    Selbekk, Tormod
    Lango, Thomas
    Hernes, Toril A. Nagelhus
    Leira, Hakon Olav
    Unsgard, Geirmund
    Lindseth, Frank
    [J]. INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2016, 11 (04) : 505 - 519
  • [45] FLR: an open-source framework for the evaluation and development of management strategies
    Kell, L. T.
    Mosqueira, I.
    Grosjean, P.
    Fromentin, J-M.
    Garcia, D.
    Hillary, R.
    Jardim, E.
    Mardle, S.
    Pastoors, M. A.
    Poos, J. J.
    Scott, F.
    Scott, R. D.
    [J]. ICES JOURNAL OF MARINE SCIENCE, 2007, 64 (04) : 640 - 646
  • [46] SlicerHeart: An open-source computing platform for cardiac image analysis and modeling
    Lasso, Andras
    Herz, Christian
    Nam, Hannah
    Cianciulli, Alana
    Pieper, Steve
    Drouin, Simon
    Pinter, Csaba
    St-Onge, Samuelle
    Vigil, Chad
    Ching, Stephen
    Sunderland, Kyle
    Fichtinger, Gabor
    Kikinis, Ron
    Jolley, Matthew A.
    [J]. FRONTIERS IN CARDIOVASCULAR MEDICINE, 2022, 9
  • [47] CustusX: an open-source research platform for image-guided therapy
    Christian Askeland
    Ole Vegard Solberg
    Janne Beate Lervik Bakeng
    Ingerid Reinertsen
    Geir Arne Tangen
    Erlend Fagertun Hofstad
    Daniel Høyer Iversen
    Cecilie Våpenstad
    Tormod Selbekk
    Thomas Langø
    Toril A. Nagelhus Hernes
    Håkon Olav Leira
    Geirmund Unsgård
    Frank Lindseth
    [J]. International Journal of Computer Assisted Radiology and Surgery, 2016, 11 : 505 - 519
  • [48] An open-source image classifier for characterizing recreational activities across landscapes
    Winder, Samantha G.
    Lee, Heera
    Seo, Bumsuk
    Lia, Emilia H.
    Wood, Spencer A.
    [J]. PEOPLE AND NATURE, 2022, 4 (05) : 1249 - 1262
  • [49] SIMPA: an open-source toolkit for simulation and image processing for photonics and acoustics
    Groehl, Janek
    Dreher, Kris K.
    Schellenberg, Melanie
    Rix, Tom
    Holzwarth, Niklas
    Vieten, Patricia
    Ayala, Leonardo
    Bohndiek, Sarah E.
    Seitel, Alexander
    Maier-Hein, Lena
    [J]. JOURNAL OF BIOMEDICAL OPTICS, 2022, 27 (08)
  • [50] PyRBD: An Open-Source Reliability Block Diagram Evaluation Tool
    Janardhanan, Shakthivelu
    Badnava, Sareh
    Agarwal, Ritanshi
    Mas-Machuca, Carmen
    [J]. 2024 IEEE INTERNATIONAL WORKSHOP TECHNICAL COMMITTEE ON COMMUNICATIONS QUALITY AND RELIABILITY, CQR 2024, 2024, : 19 - 24