Relation-Aware Diffusion Model for Controllable Poster Layout Generation

被引:9
作者
Li, Fengheng [1 ]
Liu, An [2 ]
Feng, Wei [2 ]
Zhu, Honghe [2 ]
Li, Yaoyu [2 ]
Zhang, Zheng [2 ]
Lv, Jingjing [2 ]
Zhu, Xin [2 ]
Shen, Junjie [2 ]
Lin, Zhangang [2 ]
Shao, Jingping [2 ]
机构
[1] Nankai Univ, TKLNDST, CS, Tianjin, Peoples R China
[2] JD, Retail Platform Operat & Mkt Ctr, Beijing, Peoples R China
来源
PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023 | 2023年
关键词
Poster layout generation; Diffusion model; Controllable generation; Relation-aware;
D O I
10.1145/3583780.3615028
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Poster layout is a crucial aspect of poster design. Prior methods primarily focus on the correlation between visual content and graphic elements. However, a pleasant layout should also consider the relationship between visual and textual contents and the relationship between elements. In this study, we introduce a relation-aware diffusion model for poster layout generation that incorporates these two relationships in the generation process. Firstly, we devise a visual-textual relation-aware module that aligns the visual and textual representations across modalities, thereby enhancing the layout's efficacy in conveying textual information. Subsequently, we propose a geometry relation-aware module that learns the geometry relationship between elements by comprehensively considering contextual information. Additionally, the proposed method can generate diverse layouts based on user constraints. To advance research in this field, we have constructed a poster layout dataset named CGL-Dataset V2. Our proposed method outperforms state-of-the-art methods on CGL-Dataset V2. The data and code will be available at https://github.com/liuan0803/RADM.
引用
收藏
页码:1249 / 1258
页数:10
相关论文
共 34 条
[1]  
Austin J, 2021, ADV NEUR IN
[2]   Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models [J].
Blattmann, Andreas ;
Rombach, Robin ;
Ling, Huan ;
Dockhorn, Tim ;
Kim, Seung Wook ;
Fidler, Sanja ;
Kreis, Karsten .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :22563-22575
[3]   Automatic Stylistic Manga Layout [J].
Cao, Ying ;
Chan, Antoni B. ;
Lau, Rynson W. H. .
ACM TRANSACTIONS ON GRAPHICS, 2012, 31 (06)
[4]  
Cao Yunning, 2022, P 30 ACM INT C MULT
[5]  
Chen Shoufa, 2022, ARXIV221109788
[6]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[7]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[8]  
Ho Jonathan., 2020, P 34 INT C NEURAL IN, P6840
[9]   Unifying Layout Generation with a Decoupled Diffusion Model [J].
Hui, Mude ;
Zhang, Zhizheng ;
Zhang, Xiaoyi ;
Xie, Wenxuan ;
Wang, Yuwang ;
Lu, Yan .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, :1942-1951
[10]   LayoutDM: Discrete Diffusion Model for Controllable Layout Generation [J].
Inoue, Naoto ;
Kikuchi, Kotaro ;
Simo-Serra, Edgar ;
Otani, Mayu ;
Yamaguchi, Kota .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :10167-10176