Simultaneous Localization and Segmentation of Fish Objects Using Multi-task CNN and Dense CRF

被引：8

作者：

Labao, Alfonso B. ^{[1
]}

Naval, Prospero C., Jr. ^{[1
]}

机构：

[1] Univ Philippines, Coll Engn, Dept Comp Sci, Comp Vis & Machine Intelligence Grp, Quezon City, Philippines

来源：

INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2019, PT I | 2019年 / 11431卷

关键词：

Fish object localization;

D O I：

10.1007/978-3-030-14799-0_52

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose a deep learning tool to localize fish objects in benthic underwater videos on a frame by frame basis. The deep network predicts fish object spatial coordinates and simultaneously segments the corresponding pixels of each fish object. The network follows a state of the art inception resnet v2 architecture that automatically generates informative features for object localization and mask segmentation tasks. Predicted masks are passed to dense Conditional Random Field (CRF) post-processing for contour and shape refinement. Unlike prior methods that rely on motion information to segment fish objects, our proposed method only requires RGB video frames to predict both box coordinates and object pixel masks. Independence from motion information makes our proposed model more robust to camera movements or jitters, and makes it more applicable to process underwater videos taken from unmanned water vehicles. We test the model in actual benthic underwater video frames taken from ten different sites. The proposed tool can segment fish objects despite wide camera movements, blurred underwater resolutions, and is robust to a wide variety of environments and fish species shapes.

引用

页码：600 / 612

页数：13

共 23 条

[1]

[Anonymous], 2014, EMR ENV MULTIMEDIA R

[2]

[Anonymous], 2016, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2016.348

[3]

[Anonymous], UNDERW ROBOT VEH DES

[4]

Bradski G., 2008, Learning OpenCV: Computer vision with the OpenCV library

[5] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].

Chen, Liang-Chieh ;

Papandreou, George ;

Kokkinos, Iasonas ;

Murphy, Kevin ;

Yuille, Alan L. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848

[6]

Dai J, 2015, IEEE COMPUT SOC CONF

[7] The Pascal Visual Object Classes (VOC) Challenge [J].

Everingham, Mark ;

Van Gool, Luc ;

Williams, Christopher K. I. ;

Winn, John ;

Zisserman, Andrew .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338

[8]

Fier Ryan, 2014, 2014 Oceans - St. John's, DOI 10.1109/OCEANS.2014.7003118

[9] Fast R-CNN [J].

Girshick, Ross .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448

[10]

He KM, 2020, IEEE T PATTERN ANAL, V42, P386, DOI [10.1109/ICCV.2017.322, 10.1109/TPAMI.2018.2844175]

← 1 2 3 →