DeepFish: Accurate underwater live fish recognition with a deep architecture

被引:192
作者
Qin, Hongwei [1 ,2 ]
Li, Xiu [1 ,2 ]
Liang, Jian [2 ]
Peng, Yigang [2 ]
Zhang, Changshui [2 ]
机构
[1] Tsinghua Univ, Grad Sch Shenzhen, Shenzhen 518055, Peoples R China
[2] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; Object recognition; Underwater; Cascaded network; IMAGE; CLASSIFICATION; KERNEL;
D O I
10.1016/j.neucom.2015.10.122
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Underwater object recognition is in great demand, while the research is far from enough. The unrestricted natural environment makes it a challenging task. We propose a framework to recognize fish from videos captured by underwater cameras deployed in the ocean observation network. First, we extract the foreground via sparse and low-rank matrix decomposition. Then, a deep architecture is used to extract features of the foreground fish images. In this architecture, principal component analysis (PCA) is used in two convolutional layers, followed by binary hashing in the non-linear layer and block-wise histograms in the feature pooling layer. Then spatial pyramid pooling (SPP) is used to extract information invariant to large poses. Finally, a linear SVM classifier is used for the classification. This deep network model can be trained efficiently. On a real-world fish recognition dataset, we achieve the state-of-the-art accuracy of 98.64%. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:49 / 58
页数:10
相关论文
共 44 条
[1]  
[Anonymous], 2010, P ADV NEUR INF PROC
[2]  
[Anonymous], ARXIV13024389
[3]  
[Anonymous], 2010, P 18 ACM INT C MULT, DOI [10.1145/1873951.1874249, 10.1145/1873951.1874249.2]
[4]  
[Anonymous], ARXIV14043606
[5]  
[Anonymous], ARXIV150201852
[6]  
[Anonymous], ARXIV14053531
[7]  
[Anonymous], 2007, MVA
[8]  
[Anonymous], ARXIV14094842
[9]   Subset based deep learning for RGB-D object recognition [J].
Bai, Jing ;
Wu, Yan ;
Zhang, Junming ;
Chen, Fuqiang .
NEUROCOMPUTING, 2015, 165 :280-292
[10]  
Bengio Y., 2007, LARGE SCALE KERNEL M, V34, P1, DOI DOI 10.7551/MITPRESS/7496.003.0016