Flexible content-aware image synthesis for maritime tasks with diffusion models

被引:0
|
作者
Xue, Zhenfeng [1 ,3 ]
Hu, Yuanqi [2 ,3 ]
Lu, Ankang [2 ]
Chen, Zhuo [4 ]
Zang, Ying [2 ]
Miao, Zhonghua [1 ]
机构
[1] Shanghai Univ, Sch Mechatron Engn & Automat, 99 Shangda Rd, Shanghai 200444, Peoples R China
[2] Huzhou Univ, Sch Informat Engn, 759 Second Ring East Rd, Huzhou 313000, Peoples R China
[3] Zhejiang Univ, Res Ctr Marine Robot, Huzhou Inst, 819 Xisaishan Rd, Huzhou 313098, Peoples R China
[4] Zhejiang Univ, Sch Control Sci & Engn, 38 Zheda Rd, Hangzhou 310027, Peoples R China
关键词
Image synthesis; Content-aware; Diffusion model; Maritime environmental perception; OBSTACLE DETECTION; DATASET;
D O I
10.1016/j.apor.2025.104511
中图分类号
P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
Maritime environmental perception suffers greatly from data lack due to the high cost of data collection at sea. In this paper, a novel image synthesis method is proposed to automatically generate target images with diverse foreground and background. Specifically, foreground images for various poses are generated using a diffusion model, presenting different modalities of the detected target. The environment conditions of the background images are flexibly adjusted by inputting semantic prompts to another diffusion model. Then a 3D affine diffusion model is proposed for effective fusion of foreground and background. This module calculates the size and position of the foreground image within the background image through affine transformation, and utilizes the excellent image fusion ability of the diffusion model to achieve high-quality image synthesis. As a result, a set of dynamically variable foreground and background images are generated to increase the pose and weather diversity of maritime object detection samples. Extensive experiments are conducted to verify the effectiveness of image synthesis algorithms, and this method can also serve downstream tasks, effectively improving the accuracy of maritime environmental perception algorithms. The code is available at https://github.com/xuezhen2018/flexible_content_aware_image_synthesis.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] A Content-Aware Image Prior
    Cho, Taeg Sang
    Joshi, Neel
    Zitnick, C. Lawrence
    Kang, Sing Bing
    Szeliski, Richard
    Freeman, William T.
    2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 169 - 176
  • [2] Optimized Content-Aware Image Resizing with Merging and Improved Importance Diffusion
    Danfeng Zhao
    Bo Wang
    Journal of Harbin Institute of Technology, 2015, 22 (02) : 67 - 73
  • [3] Optimized content-aware image resizing with merging and improved importance diffusion
    Zhao, Danfeng
    Wang, Bo
    Journal of Harbin Institute of Technology (New Series), 2015, 22 (02) : 67 - 73
  • [4] Content-aware preserving image generation
    Le, Giang H.
    Nguyen, Anh Q.
    Kang, Byeongkeun
    Lee, Yeejin
    NEUROCOMPUTING, 2025, 617
  • [5] Optimized Content-Aware Image Resizing with Merging and Improved Importance Diffusion
    Danfeng Zhao
    Bo Wang
    Journal of Harbin Institute of Technology(New series), 2015, (02) : 67 - 73
  • [6] Multimodal Content-Aware Image Thumbnailing
    Yamamoto, Kohei
    Kobayashi, Hayato
    Tagami, Yukihiro
    Nakayama, Hideki
    PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'16 COMPANION), 2016, : 129 - 130
  • [7] CONTENT-AWARE NEURON IMAGE ENHANCEMENT
    Liang, Haoyi
    Acton, Scott T.
    Weller, Daniel S.
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 3510 - 3514
  • [8] Content-Aware Warping for View Synthesis
    Guo, Mantang
    Hou, Junhui
    Jin, Jing
    Liu, Hui
    Zeng, Huanqiang
    Lu, Jiwen
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (08) : 9486 - 9503
  • [9] Content-Aware Image Resizing Based on Aesthetic
    Sheu, Jia-Shing
    Kao, Yi-Ching
    Chu, Hao
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS (ACIIDS 2012), PT I, 2012, 7196 : 136 - 145
  • [10] LOW COMPLEXITY CONTENT-AWARE IMAGE RETARGETING
    Sun, Kairan
    Yan, Bo
    Gao, Yiqi
    2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 2105 - 2108