Context-Driven Detection of Invertebrate Species in Deep-Sea Video

被引:9
作者
McEver, R. Austin [1 ]
Zhang, Bowen [1 ]
Levenson, Connor [1 ]
Iftekhar, A. S. M. [1 ]
Manjunath, B. S. [1 ]
机构
[1] Univ Calif Santa Barbara, Santa Barbara, CA 93106 USA
基金
美国国家科学基金会;
关键词
Context driven; Substrate classification; Deep sea; Invertebrate classification; Underwater; Video dataset; CLASSIFICATION;
D O I
10.1007/s11263-023-01755-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Each year, underwater remotely operated vehicles (ROVs) collect thousands of hours of video of unexplored ocean habitats revealing a plethora of information regarding biodiversity on Earth. However, fully utilizing this information remains a challenge as proper annotations and analysis require trained scientists' time, which is both limited and costly. To this end, we present a Dataset for Underwater Substrate and Invertebrate Analysis (DUSIA), a benchmark suite and growing large-scale dataset to train, validate, and test methods for temporally localizing four underwater substrates as well as temporally and spatially localizing 59 underwater invertebrate species. DUSIA currently includes over ten hours of footage across 25 videos captured in 1080p at 30 fps by an ROV following pre-planned transects across the ocean floor near the Channel Islands of California. Each video includes annotations indicating the start and end times of substrates across the video in addition to counts of species of interest. Some frames are annotated with precise bounding box locations for invertebrate species of interest, as seen in Fig. 1. To our knowledge, DUSIA is the first dataset of its kind for deep sea exploration, with video from a moving camera, that includes substrate annotations and invertebrate species that are present at significant depths where sunlight does not penetrate. Additionally, we present the novel context-driven object detector (CDD) where we use explicit substrate classification to influence an object detection network to simultaneously predict a substrate and species class influenced by that substrate. We also present a method for improving training on partially annotated bounding box frames. Finally, we offer a baseline method for automating the counting of invertebrate species of interest.
引用
收藏
页码:1367 / 1388
页数:22
相关论文
共 44 条
[1]   Weakly Supervised Learning of Instance Segmentation with Inter-pixel Relations [J].
Ahn, Jiwoon ;
Cho, Sunghyun ;
Kwak, Suha .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :2204-2213
[2]  
Anantharajah K, 2014, IEEE WINT CONF APPL, P309, DOI 10.1109/WACV.2014.6836084
[3]  
[Anonymous], 2016, JAMST E LIB DEEP SEA
[4]  
Barrett N, 2011, METHODS PROCESSING S
[5]   What's the Point: Semantic Segmentation with Point Supervision [J].
Bearman, Amy ;
Russakovsky, Olga ;
Ferrari, Vittorio ;
Fei-Fei, Li .
COMPUTER VISION - ECCV 2016, PT VII, 2016, 9911 :549-565
[6]  
Beery S, 2020, PROC CVPR IEEE, P13072, DOI 10.1109/CVPR42600.2020.01309
[7]   Improving Automated Annotation of Benthic Survey Images Using Wide-band Fluorescence [J].
Beijbom, Oscar ;
Treibitz, Tali ;
Kline, David I. ;
Eyal, Gal ;
Khen, Adi ;
Neal, Benjamin ;
Loya, Yossi ;
Mitchell, B. Greg ;
Kriegman, David .
SCIENTIFIC REPORTS, 2016, 6
[8]  
Beijbom O, 2012, PROC CVPR IEEE, P1170, DOI 10.1109/CVPR.2012.6247798
[9]  
Bett B. J., 2015, TIME LAPSE IMAGES PO, DOI [10.5285/21-9ef8a-7562-4b9e-e053-6c86abc0ccb8/, DOI 10.5285/21-9EF8A-7562-4B9E-E053-6C86ABC0CCB8]
[10]   Australian sea-floor survey data, with images and expert annotations [J].
Bewley, Michael ;
Friedman, Ariell ;
Ferrari, Renata ;
Hill, Nicole ;
Hovey, Renae ;
Barrett, Neville ;
Marzinelli, Ezequiel M. ;
Pizarro, Oscar ;
Figueira, Will ;
Meyer, Lisa ;
Babcock, Russ ;
Bellchambers, Lynda ;
Byrne, Maria ;
Williams, Stefan B. .
SCIENTIFIC DATA, 2015, 2