DeepFish: Accurate underwater live fish recognition with a deep architecture

被引：192

作者：

Qin, Hongwei ^{[1
,2
]}

Li, Xiu ^{[1
,2
]}

Liang, Jian ^{[2
]}

Peng, Yigang ^{[2
]}

Zhang, Changshui ^{[2
]}

机构：

[1] Tsinghua Univ, Grad Sch Shenzhen, Shenzhen 518055, Peoples R China

[2] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China

来源：

NEUROCOMPUTING | 2016年 / 187卷

基金：

中国国家自然科学基金;

关键词：

Deep learning; Object recognition; Underwater; Cascaded network; IMAGE; CLASSIFICATION; KERNEL;

D O I：

10.1016/j.neucom.2015.10.122

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Underwater object recognition is in great demand, while the research is far from enough. The unrestricted natural environment makes it a challenging task. We propose a framework to recognize fish from videos captured by underwater cameras deployed in the ocean observation network. First, we extract the foreground via sparse and low-rank matrix decomposition. Then, a deep architecture is used to extract features of the foreground fish images. In this architecture, principal component analysis (PCA) is used in two convolutional layers, followed by binary hashing in the non-linear layer and block-wise histograms in the feature pooling layer. Then spatial pyramid pooling (SPP) is used to extract information invariant to large poses. Finally, a linear SVM classifier is used for the classification. This deep network model can be trained efficiently. On a real-world fish recognition dataset, we achieve the state-of-the-art accuracy of 98.64%. (C) 2015 Elsevier B.V. All rights reserved.

引用

页码：49 / 58

页数：10

共 44 条

[1]

[Anonymous], 2010, P ADV NEUR INF PROC

[2]

[Anonymous], ARXIV13024389

[3]

[Anonymous], 2010, P 18 ACM INT C MULT, DOI [10.1145/1873951.1874249, 10.1145/1873951.1874249.2]

[4]

[Anonymous], ARXIV14043606

[5]

[Anonymous], ARXIV150201852

[6]

[Anonymous], ARXIV14053531

[7]

[Anonymous], 2007, MVA

[8]

[Anonymous], ARXIV14094842

[9] Subset based deep learning for RGB-D object recognition [J].

Bai, Jing ;

Wu, Yan ;

Zhang, Junming ;

Chen, Fuqiang .

NEUROCOMPUTING, 2015, 165 :280-292

[10]

Bengio Y., 2007, LARGE SCALE KERNEL M, V34, P1, DOI DOI 10.7551/MITPRESS/7496.003.0016

← 1 2 3 4 5 →