USOD10K: A New Benchmark Dataset for Underwater Salient Object Detection

被引:28
作者
Hong, Lin [1 ]
Wang, Xin [1 ,2 ]
Zhang, Gan [1 ]
Zhao, Ming [1 ]
机构
[1] Harbin Inst Technol Shenzhen, Sch Mech Engn & Automat, Shenzhen 518055, Peoples R China
[2] Guangdong Prov Key Lab Intelligent Morphing Mech &, Shenzhen 518055, Peoples R China
基金
中国国家自然科学基金;
关键词
Visualization; Benchmark testing; Task analysis; Transformers; Object detection; Convolution; Decoding; Underwater salient object detection (USOD); transformer; benchmark; dataset; baseline; NETWORK; DESIGN; MODEL;
D O I
10.1109/TIP.2023.3266163
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Underwater salient object detection (USOD) is an emerging research area that has great potential for various underwater visual tasks. However, USOD research is still in its early stage due to the lack of large-scale datasets within which salient objects are well-defined and pixel-wise annotated. To address this issue, this paper introduces a new dataset named USOD10K. It contains 10,255 underwater images, covering 70 categories of salient objects in 12 different underwater scenes. Moreover, the USOD10K provides salient object boundaries and depth maps of all images. The USOD10K is the first large-scale dataset in the USOD community, making a significant leap in diversity, complexity, and scalability. Secondly, a simple but strong baseline termed TC-USOD is proposed for the USOD10K. The TC-USOD adopts a hybrid architecture based on an encoder-decoder design that leverages transformer and convolution as the basic computational building block of the encoder and decoder, respectively. Thirdly, we make a comprehensive summarization of 35 state-of-the-art SOD/USOD methods and benchmark them on the existing USOD dataset and the USOD10K. The results show that our TC-USOD achieves superior performance on all datasets tested. Finally, several other use cases of the USOD10K are discussed, and future directions of USOD research are pointed out. This work will promote the development of the USOD research and facilitate further research on underwater visual tasks and visually-guided underwater robots. To pave the road in the USOD research field, the dataset, code, and benchmark results are publicly available: https://github.com/Underwater-Robotic-Lab/USOD10K.
引用
收藏
页码:1602 / 1615
页数:14
相关论文
共 104 条
[1]  
Achanta R, 2009, PROC CVPR IEEE, P1597, DOI 10.1109/CVPRW.2009.5206596
[2]   Sea-thru: A Method For Removing Water From Underwater Images [J].
Akkaynak, Derya ;
Treibitz, Tali .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :1682-1691
[3]   A Revised Underwater Image Formation Model [J].
Akkaynak, Derya ;
Treibitz, Tali .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6723-6732
[4]   A Fully Automated Method to Detect and Segment a Manufactured Object in an Underwater Color Image [J].
Barat, Christian ;
Phlypo, Ronald .
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2010,
[5]   Saliency-based Detection for Maritime Object Tracking [J].
Cane, Tom ;
Ferryman, James .
PROCEEDINGS OF 29TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, (CVPRW 2016), 2016, :1257-1264
[6]  
Chen ZY, 2020, AAAI CONF ARTIF INTE, V34, P10599
[7]   A Highly Efficient Model to Study the Semantics of Salient Object Detection [J].
Cheng, Ming-Ming ;
Gao, Shang-Hua ;
Borji, Ali ;
Tan, Yong-Qiang ;
Lin, Zheng ;
Wang, Meng .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (11) :8006-8021
[8]  
Choromanski K., 2021, P INT C LEARN REPR, P1
[9]  
Cui Z., 2019, DATA SCI, P157
[10]   A tutorial on the cross-entropy method [J].
De Boer, PT ;
Kroese, DP ;
Mannor, S ;
Rubinstein, RY .
ANNALS OF OPERATIONS RESEARCH, 2005, 134 (01) :19-67