Flexible content-aware image synthesis for maritime tasks with diffusion models

被引:0
|
作者
Xue, Zhenfeng [1 ,3 ]
Hu, Yuanqi [2 ,3 ]
Lu, Ankang [2 ]
Chen, Zhuo [4 ]
Zang, Ying [2 ]
Miao, Zhonghua [1 ]
机构
[1] Shanghai Univ, Sch Mechatron Engn & Automat, 99 Shangda Rd, Shanghai 200444, Peoples R China
[2] Huzhou Univ, Sch Informat Engn, 759 Second Ring East Rd, Huzhou 313000, Peoples R China
[3] Zhejiang Univ, Res Ctr Marine Robot, Huzhou Inst, 819 Xisaishan Rd, Huzhou 313098, Peoples R China
[4] Zhejiang Univ, Sch Control Sci & Engn, 38 Zheda Rd, Hangzhou 310027, Peoples R China
关键词
Image synthesis; Content-aware; Diffusion model; Maritime environmental perception; OBSTACLE DETECTION; DATASET;
D O I
10.1016/j.apor.2025.104511
中图分类号
P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
Maritime environmental perception suffers greatly from data lack due to the high cost of data collection at sea. In this paper, a novel image synthesis method is proposed to automatically generate target images with diverse foreground and background. Specifically, foreground images for various poses are generated using a diffusion model, presenting different modalities of the detected target. The environment conditions of the background images are flexibly adjusted by inputting semantic prompts to another diffusion model. Then a 3D affine diffusion model is proposed for effective fusion of foreground and background. This module calculates the size and position of the foreground image within the background image through affine transformation, and utilizes the excellent image fusion ability of the diffusion model to achieve high-quality image synthesis. As a result, a set of dynamically variable foreground and background images are generated to increase the pose and weather diversity of maritime object detection samples. Extensive experiments are conducted to verify the effectiveness of image synthesis algorithms, and this method can also serve downstream tasks, effectively improving the accuracy of maritime environmental perception algorithms. The code is available at https://github.com/xuezhen2018/flexible_content_aware_image_synthesis.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Significance-Preserving-Guided Content-Aware Image Retargeting
    Sung, Y.-H. (facetoface9999@gmail.com), 1600, Springer Science and Business Media Deutschland GmbH (21):
  • [42] Content-Aware Image Retargeting with Controlled Distortion for Small Displays
    Tripathi, Prasun Chandra
    Pal, Rajarshi
    2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2016, : 193 - 200
  • [43] Generalised Gradient Vector Flow for Content-Aware Image Resizing
    Rotondo, Tiziana
    Ortis, Alessandro
    Battiato, Sebastiano
    IMAGE ANALYSIS AND PROCESSING - ICIAP 2019, PT II, 2019, 11752 : 260 - 270
  • [44] An optimized fast image resizing method based on content-aware
    Lu, Yan
    Gao, Kun
    Wang, Kewang
    Xu, Tingfa
    INTERNATIONAL SYMPOSIUM ON OPTOELECTRONIC TECHNOLOGY AND APPLICATION 2014: IMAGE PROCESSING AND PATTERN RECOGNITION, 2014, 9301
  • [45] Content-aware Facial Image Compression with Deep Learning Method
    Hu, Shuzhan
    Duan, Yiping
    Tao, Xiaoming
    Liu, Yongjia
    Zhang, Xuming
    Lu, Jianhua
    2020 12TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2020, : 516 - 521
  • [46] Green Energy and Content-Aware Data Transmissions in Maritime Wireless Communication Networks
    Yang, Tingting
    Zheng, Zhongming
    Liang, Hao
    Deng, Ruilong
    Cheng, Nan
    Shen, Xuemin
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2015, 16 (02) : 751 - 762
  • [47] Saliency-based content-aware lifestyle image mosaics
    Guo, Dongyan
    Tang, Jinhui
    Cui, Ying
    Ding, Jundi
    Zhao, Chunxia
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2015, 26 : 192 - 199
  • [48] A Hybrid Nonlinear and Linear Approach for Content-Aware Image Downscaling
    Owada, Takumi
    Kameda, Yusuke
    Matsuda, Ichiro
    Itoh, Susumu
    INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY (IWAIT) 2020, 2020, 11515
  • [49] WAVELET BASED SEAM CARVING FOR CONTENT-AWARE IMAGE RESIZING
    Han, Jong-Woo
    Choi, Kang-Sun
    Wang, Tae-Shick
    Cheon, Sung-Hyun
    Ko, Sung-Jea
    2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 345 - 348
  • [50] A New Watermarking Attack Based on Content-Aware Image Resizing
    Taherinia, A. H.
    Jamzad, M.
    2009 INTERNATIONAL CONFERENCE ON MULTIMEDIA, SIGNAL PROCESSING AND COMMUNICATION TECHNOLOGIES, 2009, : 177 - 180