Flexible content-aware image synthesis for maritime tasks with diffusion models

被引：0

作者：

Xue, Zhenfeng ^{[1
,3
]}

Hu, Yuanqi ^{[2
,3
]}

Lu, Ankang ^{[2
]}

Chen, Zhuo ^{[4
]}

Zang, Ying ^{[2
]}

Miao, Zhonghua ^{[1
]}

机构：

[1] Shanghai Univ, Sch Mechatron Engn & Automat, 99 Shangda Rd, Shanghai 200444, Peoples R China

[2] Huzhou Univ, Sch Informat Engn, 759 Second Ring East Rd, Huzhou 313000, Peoples R China

[3] Zhejiang Univ, Res Ctr Marine Robot, Huzhou Inst, 819 Xisaishan Rd, Huzhou 313098, Peoples R China

[4] Zhejiang Univ, Sch Control Sci & Engn, 38 Zheda Rd, Hangzhou 310027, Peoples R China

来源：

APPLIED OCEAN RESEARCH | 2025年 / 158卷

关键词：

Image synthesis; Content-aware; Diffusion model; Maritime environmental perception; OBSTACLE DETECTION; DATASET;

D O I：

10.1016/j.apor.2025.104511

中图分类号：

P75 [海洋工程];

学科分类号：

0814 ; 081505 ; 0824 ; 082401 ;

摘要：

Maritime environmental perception suffers greatly from data lack due to the high cost of data collection at sea. In this paper, a novel image synthesis method is proposed to automatically generate target images with diverse foreground and background. Specifically, foreground images for various poses are generated using a diffusion model, presenting different modalities of the detected target. The environment conditions of the background images are flexibly adjusted by inputting semantic prompts to another diffusion model. Then a 3D affine diffusion model is proposed for effective fusion of foreground and background. This module calculates the size and position of the foreground image within the background image through affine transformation, and utilizes the excellent image fusion ability of the diffusion model to achieve high-quality image synthesis. As a result, a set of dynamically variable foreground and background images are generated to increase the pose and weather diversity of maritime object detection samples. Extensive experiments are conducted to verify the effectiveness of image synthesis algorithms, and this method can also serve downstream tasks, effectively improving the accuracy of maritime environmental perception algorithms. The code is available at https://github.com/xuezhen2018/flexible_content_aware_image_synthesis.

引用

页数：11

共 50 条

[41] Significance-Preserving-Guided Content-Aware Image Retargeting
Sung, Y.-H. (facetoface9999@gmail.com), 1600, Springer Science and Business Media Deutschland GmbH (21):
[42] Content-Aware Image Retargeting with Controlled Distortion for Small Displays
Tripathi, Prasun Chandra
Pal, Rajarshi
2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2016, : 193 - 200
[43] Generalised Gradient Vector Flow for Content-Aware Image Resizing
Rotondo, Tiziana
Ortis, Alessandro
Battiato, Sebastiano
IMAGE ANALYSIS AND PROCESSING - ICIAP 2019, PT II, 2019, 11752 : 260 - 270
[44] An optimized fast image resizing method based on content-aware
Lu, Yan
Gao, Kun
Wang, Kewang
Xu, Tingfa
INTERNATIONAL SYMPOSIUM ON OPTOELECTRONIC TECHNOLOGY AND APPLICATION 2014: IMAGE PROCESSING AND PATTERN RECOGNITION, 2014, 9301
[45] Content-aware Facial Image Compression with Deep Learning Method
Hu, Shuzhan
Duan, Yiping
Tao, Xiaoming
Liu, Yongjia
Zhang, Xuming
Lu, Jianhua
2020 12TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2020, : 516 - 521
[46] Green Energy and Content-Aware Data Transmissions in Maritime Wireless Communication Networks
Yang, Tingting
Zheng, Zhongming
Liang, Hao
Deng, Ruilong
Cheng, Nan
Shen, Xuemin
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2015, 16 (02) : 751 - 762
[47] Saliency-based content-aware lifestyle image mosaics
Guo, Dongyan
Tang, Jinhui
Cui, Ying
Ding, Jundi
Zhao, Chunxia
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2015, 26 : 192 - 199
[48] A Hybrid Nonlinear and Linear Approach for Content-Aware Image Downscaling
Owada, Takumi
Kameda, Yusuke
Matsuda, Ichiro
Itoh, Susumu
INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY (IWAIT) 2020, 2020, 11515
[49] WAVELET BASED SEAM CARVING FOR CONTENT-AWARE IMAGE RESIZING
Han, Jong-Woo
Choi, Kang-Sun
Wang, Tae-Shick
Cheon, Sung-Hyun
Ko, Sung-Jea
2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 345 - 348
[50] A New Watermarking Attack Based on Content-Aware Image Resizing
Taherinia, A. H.
Jamzad, M.
2009 INTERNATIONAL CONFERENCE ON MULTIMEDIA, SIGNAL PROCESSING AND COMMUNICATION TECHNOLOGIES, 2009, : 177 - 180

← 1 2 3 4 5 →