Flexible content-aware image synthesis for maritime tasks with diffusion models

被引:0
|
作者
Xue, Zhenfeng [1 ,3 ]
Hu, Yuanqi [2 ,3 ]
Lu, Ankang [2 ]
Chen, Zhuo [4 ]
Zang, Ying [2 ]
Miao, Zhonghua [1 ]
机构
[1] Shanghai Univ, Sch Mechatron Engn & Automat, 99 Shangda Rd, Shanghai 200444, Peoples R China
[2] Huzhou Univ, Sch Informat Engn, 759 Second Ring East Rd, Huzhou 313000, Peoples R China
[3] Zhejiang Univ, Res Ctr Marine Robot, Huzhou Inst, 819 Xisaishan Rd, Huzhou 313098, Peoples R China
[4] Zhejiang Univ, Sch Control Sci & Engn, 38 Zheda Rd, Hangzhou 310027, Peoples R China
关键词
Image synthesis; Content-aware; Diffusion model; Maritime environmental perception; OBSTACLE DETECTION; DATASET;
D O I
10.1016/j.apor.2025.104511
中图分类号
P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
Maritime environmental perception suffers greatly from data lack due to the high cost of data collection at sea. In this paper, a novel image synthesis method is proposed to automatically generate target images with diverse foreground and background. Specifically, foreground images for various poses are generated using a diffusion model, presenting different modalities of the detected target. The environment conditions of the background images are flexibly adjusted by inputting semantic prompts to another diffusion model. Then a 3D affine diffusion model is proposed for effective fusion of foreground and background. This module calculates the size and position of the foreground image within the background image through affine transformation, and utilizes the excellent image fusion ability of the diffusion model to achieve high-quality image synthesis. As a result, a set of dynamically variable foreground and background images are generated to increase the pose and weather diversity of maritime object detection samples. Extensive experiments are conducted to verify the effectiveness of image synthesis algorithms, and this method can also serve downstream tasks, effectively improving the accuracy of maritime environmental perception algorithms. The code is available at https://github.com/xuezhen2018/flexible_content_aware_image_synthesis.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Patchwise scaling method for content-aware image resizing
    Liang, Yun
    Su, Zhuo
    Luo, Xiaonan
    SIGNAL PROCESSING, 2012, 92 (05) : 1243 - 1257
  • [22] Adaptive Energy Selection for Content-Aware Image Resizing
    Sasaki, Kazuma
    Nagahama, Yuya
    Ze, Zheng
    Iizuka, Satoshi
    Simo-Serra, Edgar
    Mochizuki, Yoshihiko
    Ishikawa, Hiroshi
    PROCEEDINGS 2017 4TH IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2017, : 852 - 857
  • [23] Real-time content-aware image resizing
    ROSIN Paul L.
    ScienceinChina(SeriesF:InformationSciences), 2009, 52 (02) : 172 - 182
  • [24] Content-aware compressive magnetic resonance image reconstruction
    Weller, Daniel S.
    Salerno, Michael
    Meyer, Craig H.
    MAGNETIC RESONANCE IMAGING, 2018, 52 : 118 - 130
  • [25] Improved Seam Merging for Content-Aware Image Resizing
    Mishiba, Kazu
    Ikehara, Masaaki
    Yoshitome, Takeshi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (02): : 349 - 356
  • [26] Real-time content-aware image resizing
    ROSIN Paul L.
    Science China(Information Sciences), 2009, (02) : 172 - 182
  • [27] Real-time content-aware image resizing
    Hua Huang
    TianNan Fu
    Paul L. Rosin
    Chun Qi
    Science in China Series F: Information Sciences, 2009, 52
  • [28] Content-aware image resizing based on hybrid energy
    Lei L.-X.
    Jisuanji Xuebao/Chinese Journal of Computers, 2010, 33 (10): : 2015 - 2021
  • [29] Real-time content-aware image resizing
    Huang Hua
    Fu TianNan
    Rosin, Paul L.
    Qi Chun
    SCIENCE IN CHINA SERIES F-INFORMATION SCIENCES, 2009, 52 (02): : 172 - 182
  • [30] Target-preserving Content-aware Image Resizing
    Chuang, Cheng-Hung
    Shi, Zhong-Yan
    2012 SIXTH INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTING (ICGEC), 2012, : 19 - 22